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ABSTRACT 


The  five  technical  working  papers  that  compose  this  document  (which  appears  in 
two  volumes)  were  prepared  as  part  of  the  Dialogue  Modeling  Project  at  ISI.  Though 
diverse  in  scope,  all  are  related  to  the  problem  of  creating  a valid  process  model  of  human 
communication  in  dialogue.  All  are  unpublished  and  all  but  one  are  in  a form  intended  for 
internal  use  by  the  project  team;  however,  they  are  of  interest  beyond  the  boundaries  of 
the  project  and  have  implications  for  related  work  in  modeling  human  communication. 

In  Volume  1 both  papers  are  on  reference  as  a phenomenon  in  text.  The  first 
surveys  reference  identification  and  resolution  methods  in  various  existing  natural 
language  processors.  The  other  paper  explores  the  broader  problem  of  reference, 
focusing  on  text  reference  and  propositional  reference.  It  develops  problems  and 
proposals  for  defining  these  categories  of  reference  phenomena  and  for  detecting 
instances  of  them. 

In  Volume  2 the  first  paper  concerns  study  methodology.  It  raises  some  of  the 
following  issues;  how  to  choose  between  system-building  and  process-building,  why 
studying  cases  is  preferable  to  implementing  general  language-use  functions  as  programs, 
how  io  control  ad-hocness  of  results,  why  it  is  important  to  orient  toward  communication 
phenomena  (in  contrast  to  form  phenomena)  when  studying  natural  language.  The  second 
is  a design  paper  on  the  Match  process  of  the  Dialogue  Model  System,  exploring  methods 
for  making  it  efficient  and  selective  in  its  actions.  The  third  concerns  the  structure  o 
persuasion  dialogues,  in  particular  how  justification  of  actions  appears  in  argumentation. 

Each  working  paper  appears  with  its  original  abstract  or  introduction. 
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One  of  the  challenges  in  modeling  human  language  capability  is  the  fact  that 
words  are  used  to  convey  different  meanings  in  different  contexts.  For  example,  the 
meaning  of  a pronoun  is  heavily  dependent  on  the  context  in  which  it  occurs,  since  it  is 
used  to  refer  to  some  entity  mentioned  elsewhere  and  has  the  same  referent  as  this 
othbt  expression.  The  meaning  of  instances  of  "it"  may  change  radically  from  occurence 
to  occuronce,  even  within  the  same  utterance.  In  the  following  portion  of  dialogue  » : 

"0:  Did  Runoff  produce  any  output? 

L:  I doiit  Know  but  I thought  it  would  ask  me  what  name  it  would  be." 

In  the  sec  and  utterance,  the  two  “it“s  refer  to  completely  different  things  expressed  in 
the  first  uilerance,  the  first  to  "Runoff"  and  the  second  to  "output". 

Many  of  the  process  models  of  languagt*  understanding  have  dealt  with  this  issue 
of  repeated  reference,  in  which  one  expression  (often  containing  a pronoun  or  a 
determiner)  derives  meaning  from  some  other  expression  in  the  discourse.  Generally, 
these  models  contain  a set  of  heuristic  rules  for  ordering  possible  co-referential 
expressions  and  for  testing  them  for  appropriateness.  For  these  models,  the  rules  form 
a "bag  of  tricks"  approach  to  repeated  reference,  rather  than  a systematic  theory. 

In  this  paper,  we  will  examine  in  detail  several  models  of  language  understanding 
that  deal  with  repeated  reference.  After  examining  the  operation  of  these  models  on  a 
simple  dialogue,  we  will  classify  the  assembled  "bag  of  repeated  reference  tricks"  into 
two  categories,  and  from  this  classification,  propose  a new  approach  for  the  process 
modeling  of  repeated  reference. 


I.  Ht^fwaU’d  Krfcroncc 

Repeated  Reference  occurs  when  two  sets  of  words  in  a discourse  refer  to  the 
same  concept.  Let  us  call  a set  of  words  in  an  utterance  defined  as  a unit  on  syntactic 
grounds  an  Expression.  A Co-referential  Expression  is  an  Expression  that  has  the  same 
referent  as  some  other  Expression.  Not  all  E;.prc5sions  are  Co-referential  Expressions. 
Sometimes  definite  noun  phrases  are  used  to  refer  to  generic  concepts  ("Co  you  know 
how  the  mail  syntcm  works?").  Other  limes,  an  Expression  can  introduce  a unique 
concept  {"...thr  IS!  linn  primnr  Or  a concept  that  is  unique  in  the  given  situation  ("I 
just  used  a system  called  XOFF  and  it  didn’t  give  me  ihr  normal  nuiimi").  Other 
Expressions  in  fact  don’t  refer  to  anything  CIt's  5 oclock.”). 

It  is  important  to  distinguish  between  Expressions  and  Concepts.  Expressions  are 
sets  of  words,  while  Concepts  are  the  abstract  entities  which  are  the  referents  of 
Expressions  (as  well  as  leferents  of  other  non-verbal  stimuli). 


: * Unless  otherwise  specified,  all  examples  in  this  paper  are  taken  frorr,  naturally 

■"  occurring  dialogues  collected  by  the  Dialogue  Modeling  Project  at  ISI.  These  dialogues 

are  between  a computer  operator  (labeled  "0")  and  a computer  user  ("L").  The 
^ participants  communicated  remotely  by  typing  into  computer  terminals  using  the  TENEX 

I "link"  facility,  whatever  either  person  types  appears  simultaneously  on  both  terminals. 

is'' 

1$ 

K 
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We  can  easily  detect  Expressions  because  they  are  defined  on  a syntactic  basis. 
In  fact,  we  could  use  one  of  the  existing  parsing  systems  to  mechanically  detect 
Expressions.  However,  this  isn’t  true  for  Co-referential  Expressions.  Some  people 
(Baranofsky,  1970;  OIney,  1969)  have  investigated  the  detection  of  Co-referential 
Expressions  (sometimes  called  "anaphoric  expressions")  using  certain  syntactic  aspects 
to  distinguish  the  Co-referential  Expressions  from  non-repeated  ones.  The  language 
understanding  systems  described  here  all  approach  this  detection  problem  by  delaying 
the  decision  until  it  is  trivial.  They  all  look  for  preexisting  referents  for  all  Expressions, 
and  those  that  have  referents  that  are  also  the  referents  of  other  Expressions  are  then 
Co-referential  Expressions. 


2.  Repented  Reference  in  Existing  Systems 

We  will  now  concentrate  on  the  heuristics  for  finding  an  existing  referent,  given 
an  Expression.  The  operation  of  some  of  these  will  be  illustrated  on  a simple  dialogue, 
shown  in  figure  1.  This  is  a real  dialogue  between  a computer  user  and  a computer 
operator  typing  over  computer  terminals  using  the  "link"  facility. 


4 


LINK  FROn  (U,  TTY  42 

1.  1 2 3 4 5 R 7 8 9 10  11  12  13  14  15 

L;  HDU  DO  I GET  RUNOFF  TO  UORK,  I KEEP  KEQTN  IT  BUT  IT  JUST  GRABS 

1.  IG  17  18  19  20  21  22  23  24  25  2G  27  28 

MY  INPUT  FILE  ANO  THEN  SAYS  OONE  BUT  GIVES  NE  NO  OUTPUT?  GA 


2.  1 2 345G7  8 

Os  THE  OUTPUT  CONES  OUT  ON  THE  LINE  PRINTER 


3.  1 2 3 4 5 G 7 8 9 10  11  12  13  14 

L:  THROU  IT  AUAY  BUT  CAN  I GET  IT  TO  GE\EO  TO  A FILE?  GA 


4.  1 2 3 4 5 G 7 8 9 10  11  12 

Os  CONFIRN  YOUR  COnilANOS  WITH  A COnMA  AND  YOU’LL  BE  QUERIED  FOR  FILES, 

4.  13  14 

ETC.  GA 


5.  1 2 

Ls  THX  nUCHO 


BREAK  (LINKS) 


Figure  1:  Dialogue  between  a TENEX  Operator  ("0")  and  a User  ("L") 

(The  numbers  and  speaker  labels  have  been  added). 
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Our  Dialogue  Modeling  Project  at  ISI  has  developed  a procedure  to  gather 
annotations  from  observers  of  a dialogue  transcript  to  use  in  evaluating  dialogue 
models.  Figure  2 shows  the  annotation  produced  by  one  observer  of  repeated 
reference  phenomena  in  the  dialogue  in  figure  1,  using  our  repeated  reference 
instructions  (given  in  Mann,  Moore,  Levin  & Carlisle,  1975).  We  will  use  these 
observations  to  evaluate  the  various  repeated  reference  heuristics. 

2.1  Kcpeatfld  Reference  in  Verh  world 

Verbworld  (Rumelhart  ^ Levin,  1975)  is  a recent  language  understanding  system 
that  deals  with  repeated  reference  in  some  detail. 

There  are  two  parts  of  this  system  where  repeated  reference  is  handled;  the 
heuristics  for  handling  pronoun  reference  (developed  by  /^rt  Graesser)  and  the 
heuristics  for  handling  definite  determiners  (developed  by  Donalo  Norman). 

Pronouns 

Third  person  pronouns  (HE,  SHE,  IT,  THEY,  THEM,  HIM,  HER): 

1.  Look  in  the  previous  clauses  for  uses  of  this  same  pronoun. 

If  one  is  found,  it  is  a Co-referential  Expression. 

else 

2.  Examine  the  noun  phrases  in  the  previous  clause, 

looking  for  a concept  that  matches  the  number  and  gender  constraints. 

Examine  each  clause  in  the  following  order;  subject,  object, 
prepositional  phrases.  If  a match  is  found,  if  is  a Co-referential 
Expression. 

else 

3.  Start  over  with  step  2. 

Reflexive  pronouns:  (HIMSELF,  HERSELF,  ITSELF,  THEMSELVES) 

Examine  previous  noun  phrases  in  this  same  clause  for  a concept  with 
the  appropriate  gender  and  number.  If  there  is  one,  the  phrase  is  a 
Co-referential  Expression. 

Possessive  pronouns:  (HIS,  HER,  ITS,  THEIR) 

1.  Examine  noun  phrases  within  the  sentence,  as  in  the  case 
of  reflexive  pronouns.  If  there  is  a match,  it  is  a Co-referential 
Expression. 

else 

2.  Examine  noun  phrases  in  previous  clauses,  as  in  the  case  of  ■ lird  person  pronouns. 


Determiners 

1.  Evaluate  the  noun  phrase.  If  the  value  is  a unique  concept,  then 
that  value  is  the  referent. 

else 

2.  Evaluate  any  relative  clauses  in  the  noun  phrase,  jnd  if  this 
results  in  an  unique  instance,  this  instance  is  the  referent. 

else 


■t  ;,.twwffcwoi 


) riput  file  and  then  says  done  but  gives  tne  no  output?  Go  ahead 

-X ^ J -T~ 

V (3) 


I;  The  output  comes  out  on  the  line  printer 


Throu  ijt  auay  but  can  I gat  it  to  go  to  a file?  Go  ahead 

J P 7 

0;  Confirm  your  commands  uith  a comma  and  you’ll  be  queried  for  files, 

© izr 

etc.  Go  ahead 


L:  Thanks  much 


BREAK  (LINKS) 


Figure  2:  Dialogue  annotated  for  Repeated  Reference 
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there  k?*  clauses 

and  If  there  is  a match  with  the  features  of  the  rest  of  the 

noun  phrase,  that  concept  is  the  referent.  In  particular, 

f an  exemplar  of  the  general  concept  was  recently  mentioned, 
that  exemplar  is  the  referent.  enuoneo, 

else 

f in  an  indefinite  sense, 

and  create  a new  instance  as  the  referent. 


Observer  of  . ^;!i«  observer  of  it.  To  enable  it  to  operate  as  the 

heuristics-  the  fi  f modification  to  the  above 

present  speaker  thr's^o"  th! 

participant  af, be,,  ,e,ere7 


1. 


Let  us  examine  a hand-simulated  operation  of  Verbworld  on  the  dialogue  in  figure 


- — •« 

E.p,eslrbec7„se  a':';it‘a‘;e'r^^^  '"“""r  ^ 

pravioce  Cause  T isn't  neuter  and'  <7,  ^ 

co-re, ;::n,?ar<a"nd,trel%'',"uS^  ‘ " i'  as 


2). 


Both  0,  these  assisnments  correspond  to  the  annotations  by  the  observer  (figure 


s,?££SL“5”-‘" 

l'uli7/rol^VM2)th?re^ 

Kind output,  'Jf,d';Vcr,:a:ntr,^'E:“;sdo^  » 

In  turn  three,  the  pronoun  heuristics  also  run  into  frouhip  Th^  »;♦»  t r,  r,  • 

XI  :ri:  :t‘sf ^ sbi"?,be 

ne  generic  class,  and  it  (3.2)  as  an  instance  of  that  class. 

Furthermore,  the  second  "it”  (3.8)  is  assipnpd  the  fir«:t  "It"  /'j  o\  / »■  , 

The  Observer  distinguishes  ,t»se  as  ^tterentrL  we  ^can  easily  s^e^^rthrsemtnl;: 
(You  can  t ttirow  something  away  and  then  get  that  same  thing  to  go  to  a file). 

indefinS'-a’flle'  a^Tn'  ^ ^ superset  of  the 

definite  a file  at  3.13.  The  Verbworld  system  would  also  make  this  assignment 

because  of  the  way  the  indefinite  article  "a"  is  defined.  ^ ’ 
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2.2  Roficntci  Reforonco  in  SIIRI)!.U 

One  of  the  most  impressive  language  understanding  systems  to  date  is  SHRDLU, 
developed  by  Winograd  (1972).  This  system  keeps  track  of  "overall  discourse",  and 
looks  in  this  overall  discourse  context  for  referent  concepts.  Winograd  implemented  a 
set  of  heuristics  for  repeated  reference,  most  of  which  are  summarized  below: 


General  Tricks  for  Determiners 

1.  Find  the  set  of  all  known  objects  that  match  the  rest 
of  the  noun  phrase. 

then 

2.  If  the  right  number  are  known,  the  object  or  the  set  of  objects 
is  the  referent. 

else 

3.  If  there  are  too  few  objects,  try  to  reparse  the  sentence  (if 
this  fails,  print  out  a stored  phrase  asking  for  what  the  person 
meant). 

or 

4.  If  there  are  too  many,  try  to  find  which  were  mentioned  most 
recently,  (if  the  right  set  of  objects  cannot  be  found, 

try  reparsing  as  above,  but  with  different  error  message  about 
which  were  meant.) 


General  Tricks  for  Pronouns  ("IT") 

1.  If  there  is  another  "IT"  previously  in  the  same  sentence, 
it  is  a Co-referential  Expression. 

else 

2.  If  there  is  another  "IT"  in  the  previous  sentence, 
it  is  a Co-referential  Expression. 

else 

3.  For  complex  embedded  NP’s,  check  whether  the  "IT" 
is  a reference  to  the  NP  it  is  in. 

else 

4.  Look  through  previous  clauses,  looking  for  possible  co-referential  expressions, 
assigning  plausibilities  on  the  basis  of  the  following: 

a.  Subject  > Object  > Prep  Phrases 

b.  Main  clause  > Subordinate  clause 

c.  "Focused"  objects  > non-focused 

Choose  the  most  plausible  matching  concept  as  the  referent. 


Special  Case  Tricks 

IT:  if  used  as  propositional  reference,  the  most  recent  action  mentioned 
by  the  other  participant  is  the  referent. 

THAT:  if  used  as  propositional  reference,  the  most  recent  action  mentioned 
by  either  participant  is  the  referent. 
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I;  tho  roforont  is  :FRIEND 
YOU:  the  referent  is  :SHRDLU 

Now,  despite  the  differences  in  SHRDLU’s  heuristics  from  those  of  Verbworld,  it 
would  produce  the  same  referent  resolutions  on  the  figure  1 dialogue,  incorrectly 
assigning  the  same  referent  to  the  "it"s  in  turn  3 and  running  into  problems  assigning 
"the  output"  in  turn  2. 

Why  do  these  systems  do  so  poorly  on  this  simple  dialogue?  The  Verbworld 
heuristics,  for  example,  correctly  found  co-referential  expressions  for  pronouns  in  907 
of  randomly  chosen  text  from  an  encyclopedia  (Rumelharf  & Levin,  1975).  This 
disparity  in  performance  gives  us  a hint  of  what  the  problem  is.  The  encyclopedia 
contains  only  well-formed  grammatical  sentences,  while  the  dialogue  in  figure  1 contains 
many  ill-formed  utterances,  as  is  typical  of  real  dialogue.  Since  most  of  the  heuristics 
discussed  so  far  are  based  mostly  on  surface  syntactic  features,  its  not  surprising  that 
they  do  poorly  on  the  syntactically  ill-formed  utterances  in  real  dialogue. 

Now,  this  shouldn’t  be  taken  as  a claim  that  syntactically  based  heuristics  are 
useless  - only  that  there  must  be  additional  heuristics  using  semantic  and  pragmatic 
features  that  contribute  to  determining  that  Expressions  are  Co-referential. 


Rcpcnlrd  Rafcronr.c  in  S(^mnnticnlly  Oricnicd  SyniemK 

Some  systems  find  the  referents  for  Expressions  solely  on  the  basis  of  semantic 
features,  completely  ignoring  syntactic  aspects.  Quillian  (1969)  described  such  a 
system,  and  Reiger  (1974)  used  a similar  approach  to  do  the  repeated  referencing  for 
definite  noun  phrases  within  the  MARGIE  System  (Schank,  Goldman,  Rieger  & Riesbeck, 
1973) 


In  this  approach,  a set  of  specifications  for  the  referent  are  collected  and  a match 
process  is  conducted  over  the  set  of  concepts  in  the  current  context  (concepts  that 
have  been  recently  mentioned  or  used  recently  in  some  inference).  A concept  in 
context  that  matches  the  specifications  is  then  a prime  candidate  for  being  the  referent 
of  the  Expression. 

Let  us  look  at  how  a semantically  based  repeated  reference  system  would  deal 
with  the  dialogue  in  figure  1.  As  with  the  syntactic  system,  it  handles  the  personal 
pronouns,  given  the  modification  to  make  the  present  speaker  a specification  of  the 
referent  of  the  pronoun  "I",  and  the  other  participant  a specification  of  the  referent  of 
"you". 


For  the  "it"  at  1.11,  the  partial  specification  of  this  referent  is  a non-human  thing 
that  can  be  XEQTed.  ("XEQT"  is  an  abbreviation  for  "execute",  which  is  the  action  of 
starting  a computer  program).  Given  that  XEQT  was  known  to  be  an  action  applicable 
to  programs,  and  that  RUNOFF  is  known  to  be  a program,  this  semantic  approach  could 
find  RUNOFF  as  the  referent. 

There  is  a slight  problem  with  the  "it”  at  1.13,  For  the  correct  referent  to  I 
found,  the  system  would  have  to  regard  RUNOFF  as  an  animate  agent,  since  the  refereiu 
is  specified  to  be  something  which  "grabs". 
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For  the  first  Expression  in  turn  2,  the  semantic  approach  has  the  same  problem  as 
discussed  for  the  syntactic  approach  here:  it  may  find  "no  output"  in  turn  1 as 
co-referential. 

The  second  Expression  "the  line  printer"  isn’t  a repeated  reference,  but  the 
semantic  approach  may  handle  this  correctly,  depending  on  the  exact  comprehension 
and  reference  processes  it  has.  If  the  Knowledge  by  both  participants  about  RUNOFF 
and  TENEX  systems  includes  line  printers,  and  if  this  knowledge  is  accessed  in 
understanding  the  dialogue  to  this  point,  then  the  correct  referent  for  this  Expression 
will  be  correctly  found,  even  though  it  hadn’t  been  mentioned  previously. 

In  turn  3,  the  semantic  system  may  see  “the  line  printer"  as  co-referential  to  "it" 
on  the  basis  of  pure  recency.  The  semantic  system  may  be  able  to  reject  this, 
depending  on  what  it  knows  about  line  printers  and  files.  But  this  is  a case  in  which 
syntactic  clues  would  help  a pure  semantic  system,  since  these  clues  would  suggest  "the 
output"  as  a better  possibility. 

There  are  cases  in  w.nich  ignoring  syntax  will  cause  a pure  semantic  system  to 
fail. 

"...  there  is  a system  that  you  can  use  that  will  let  you  manipulate  your  unsent  mail.  It 
is  called  mailstaf,  I believe.  ..." 

In  this  case,  there  are  least  two  possible  co-referential  expressions  to  "it",  "a  system" 
and  "your  unsenf  mail",  both  of  which  fit  the  constraints.  On  syntactic  grounds,  we  can 
see  that  "a  system"  is  the  co-referential  expression.  Clearly,  we  want  to  combine  all 
the  available  syntactic,  semantic,  and  pragmatic  cues  to  repeated  reference  into  one 
integrated  approach. 


.1.  Clnssificntiou  of  iho  ling  of  Krponird  Rpforonc.o  Tricks 

Let  us  look  at  a set  of  repeated  reference  tricks,  shown  in  figure  3.  These 
include  the  ones  we  have  discussed  already,  plus  additional  repeated  reference 
heuristics  described  in  papers  by  Warnock  (1972)  and  Baranofsky  (1970). 

We  can  classify  the  bag  of  repeated  reference  tricks  into  two  categories,  aspects 
of  the  possible  Co-referential  Expressions  and  aspects  of  the  current  Expression. 
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Proximity:  The  closer  an  utterance  containing  another  Expression  is  to  the  current 
Expression,  the  more  likely  the  other  is  to  be  Co-referential  with  the  current  one. 
"I  can’t  find  any  documentation  on  the  ."'rogram,  I have  a tape  here  at  Purdue  and 
I can’t  figure  out  what  format  it's  in.” 

The  "it"  refers  to  the  tape,  not  to  the  documentation  or  to  the  program. 

Syntactic  role  m the  sentence: 

2a.  Subjecf/Object/Preposition  phrases:  The  syntactic  subject  of  an  utterance 

is  more  likely  to  be  an  Co-referential  Expression  than  the  syntactic  object,  which 
is  more  likely  than  the  preposition  phrases. 

"0:  The  output  comes  out  on  the  line  printer. 

L:  Throw  it  away  ...  " 

The  "it"  refers  to  the  output  (the  syntactic  subject  of  the  first  utterance)  rather 
than  to  the  line  printer  (the  syntactic  object). 

2b:  Superordinate/subordinate:  Concepts  expressed  in  a superordinate  clause 

are  more  likely  to  be  Co-referential  that  those  in  any  subordinate  clauses. 

" ...  the  tape  that  file  is  archived  on  seems  to  be  a bad  tape.  We  can’t  seem  to 
get  it  to  read  ...  " 

The  "it"  refers  to  the  tape  rather  than  to  the  file  (in  the  subordinate  relative 
clause). 

2c:  Topicalization:  Some  special  syntactic  constructions  (such  as  cleft 

sentences)  can  be  used  to  emphasize  one  element  of  an  utterance.  These 
emphasized  Expressions  are  more  likely  to  be  co-referential  than  unemphasized 
ones. 

" ...  there  is  a background  job  running  here  that  checks  to  see  if  there  is  any 
unsent  mail.  Once  it  finds  some,  it  tries  to  resend  it." 

The  two  "it"s  are  co-referential  with  the  lopicalized  "background  job"  rather  than 
with  the  "unsent  mail". 

Centrality:  an  concept  which  has  previously  been  referenced  more  than  once  is 

more  likely  to  be  referenced  again  than  one  referenced  only  once. 

"L:  ...  Any  chance  I can  recover  [file  name]  from  the  most  recent  system  dump? 

0:  Probably,  let  me  look  for  it  and  get  back  to  you,  ok? 

L:  Could  you  SNDMSG  to  me,  one  way  or  the  other?  I won’t  be  doing  anything 

about  it  tonight.  If  it  is  there,  I will  be  forever  grateful  to  recover  it.” 

The  "it"s  by  L all  refer  back  to  L’s  file,  rather  than  to  the  system  dump  or  the  one 
way  or  the  other,  at  least  partially  due  to  the  previous  reference  by  0. 

Current  topic:  An  Expression  which  refers  to  a concept  in  the  current  topic  is  more 
likely  to  be  Co-referential.  Deutsch  (1974)  observed  that  repeated  reference 
can  normally  be  made  only  to  concepts  that  are  part  of  a currently  open  topic. 
Once  a topic  is  closed,  It  must  be  reopened  before  concepts  within  it  can  be 
referenced  again. 
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the 


L:  ...  Can  you  recover  those  files  for  me..as  far  as  I know  they  were  in 

directory  on  the  16th.. .the  names  are  ... 

[ intervening  dialogue  ] 

0:  OK  I have  found  the  files  you  wont  I will  retrieve  those  for  you  ...  " 

In  the  second  utterance,  0 had  to  initially  specify  the  files  in  some  detail,  but  once 
the  topic  was  re-established,  she  could  use  just  "those". 


Figure  3a:  Aspects  of  Possible  Co-referential  Expressions 


13 


1.  Specification  by  the  current  Expression: 

la:  The  the  pronoun  or  determiner  in  an  Expression  often  specifies  that  the 

referent  concept  be  a particular  number  and/or  gender. 

"How  do  / get  Runoff  to  work?" 

The  pronoun  "I"  completely  specifies  the  referent  to  be  the  speaker. 

lb:  The  other  words  in  the  current  Expression  often  further  specify  the 

referent. 

"I  have  found  ihn  film  you  nro  oonrontod  nhoul  ... 

2.  Specification  by  the  verb  in  ft  same  utterance:  The  verb  in  the  same  utterance  as 

the  Expression  often  specifies  that  the  referent  have  certain  properties. 

"Throw  ii  away." 

The  verb  (and  verb  particle)  "throw  away"  specifies  the  referent  to  be  something 
of  little  value  to  the  speaker. 

3.  Specification  by  the  whole  clause  containing  the  current  expression:  Sometimes  the 

particular  combination  of  a verb  and  its  noun  phrase  arguments  puts  constraints 
on  the  referent. 

"Did  it  produce  any  output  file?" 

The  combination  of  the  verb  and  the  object  limit  the  referent  to  being  a computer 
program  of  some  kind. 


Figure  3b:  Aspects  of  the  Current  Expression 
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.1.1  The  Ponihlc  Co-referential  Exprennionn 

are  a divLl'e  co-referenf.al  expressions 

are  a diverse  lot.  covenn.  both  syntax  and  semantics.  One  thing  seems  to  be  a common 

feature  - each  seems  to  reflect  contributions  to  the  salience  of  the  various  concepts  in 

factTh  ?"n  ^°T  ° capture  what  is  called  "focus":  others  reflect  the 

refreshed  temporary,  disappearing  if  they  aren't  repeatedly 

.1.2  Aspeeis  of  the  Current  Kxpremon 

rar.  referent  of  an  Expression  is  completely  unspecified  . Each  of  the 

repeated  reference  heuristics  given  in  the  figure  3b  can  be  seen  as  contrihutinP 

ISdJ'nl"  t f unspecified  concept.  Each  heuristic  may  operate  independently  in 

the  na  expression.  And  the  end  result  is 

the  partially  specified  referent. 

This  classihcation  of  the  repeated  reference  heuristics  into  these  two  categories 
straightforward.  However,  it  suggests  a general  approach  toward  modeling  repeated 
reference  abi  ties  in  process  models.  There  are  two  parts  of  a model  - all  those 
processes  that  contribute  specifications  to  the  referent  of  the  current  exoression  and 
the  processes  that  affect  the  salience  of  all  the  other  currently  active  concepts. 


/ht  /IciivfUlott  Modi*!  of  Kppooted  Rpf orcttcc 

Let  us  assume  we  have  some  standard  parser,  that  takes  an  utterance  and  chunks 
II  up  into  Expressions. 

Let’s  put  each  of  these  units  into  one  place,  called  a Workspace,  and  give  it  a 
numeric  value,  called  its  Activation  Rate.  This  Activation  Rate  reflects  ihe  momentary 

niypir  u/K  """  " ""^portance"  metric  discussed  by  Warnock 

it:?  n " process  attempts  to  put  an  Activation  in  the 

workspace  (to  represent  some  new  intermediate  result),  the  Activation  Model  will  first 
look  for  an  identical  existing  activation.  If  none  exists,  the  Model  will  create  a new 

t however,  if  an  existing  Activation  is  found, 

the  Model  instead  increments  its  Activation  Rate  by  the  specified  amount. 

ratP.  it  existence  of  an  Activation,  Those  Activations  with 

rates  below  the  Threshold  are  removed  from  the  Workspace. 

Ratp  the  Activation  Model  is  a set  of  rules  for  modifying  the  Activation 

Rates,  derived  from  the  heuristics  in  figure  3a.  These  rules  are  given  below.  The 
ntents  of  the  Workspace  will  be  undergoing  continual  change,  with  new  activations 
existing  activations  being  deleted  whenever  they  fall  below 
hreshold.  , he  contents  at  any  one  moment  serves  as  the  current  context  for 
reference  resolution. 

Let  us  explore  the  operation  of  this  Activation  Model  of  Repeated  Reference  by 
hand-simula.ing  its  performance  on  the  dialogue  in  figure  1.  To  do  this,  we  first  have 
to  specify  a number  of  parameters  (in  a somewhat  ad  hoc  manner)  for  the  rules  given 


below.  The  first  part  of  each  rule  dcscriptiyn  covers  general  aspects,  and  the  second 
part  gives  a precise  specification  (with  ad  hoc  parameters)  that  will  be  used  to  show  the 
Operation  on  the  example  discourse. 


4.1  Rule.\  for  Modifying  the  /Ictivntion  Kales  of  Possible  referents 

1.  Recency  of  utterance  rule; 

As  each  new  utterance  comes  in,  decrement  the  rates  of  all  the  preexisting 
activations. 

For  the  example  below,  decrement  all  existing  activations  by  507. 

2.  Primacy  within  utterance  rule; 

Increment  the  activation  rate  of  the  concept  representing  each  unit  of  an 
utterance  as  it  enters,  but  by  a successively  smaller  amount.  A subclaim  of  this 
Model  is  that  the  three  separate  syntactic  factors  2a  (Subject/object/preposition 
phrase),  2b  (Super/subordinate  clauses),  and  2c  (Topicalized  constructions)  in 
figure  3a  can  be  captured  in  this  one  rule. 

For  the  example,  increment  the  activation  for  the  first  expression  by  10 
(arbitrary)  units,  the  next  by  9,  etc. 

3.  Centrality: 

When  a concept  has  been  referenced  more  than  once,  the  same  activation  will 
have  been  incremented  by  each  reference.  Thus  if  will  be  more  salient  than  if 
referenced  only  once,  and  so  more  likely  to  be  selected  again  as  a referent. 

The  centrality  aspect  is  already  captured  by  the  Activation  Model  itself,  so  we 
need  no  separate  rule. 

A.  Current  topic: 

Given  a comprehension  process  for  detecting  topic  structure,  a rule  that 
incremented  the  activation  rates  of  all  components  of  a newly  detected  topic 
would  capture  the  current  topic  aspect.  We  don’t  yet  have  a model  of  this 
process,  so  we  can’t  use  this  rule.  But  this  illustrates  the  way  that  progress  in 
modeling  other  aspects  of  natural  language  can  be  easily  interfaced  to  this 
Activation  Model  extending  its  capabilities. 

The  operator-linker  dialogues,  such  as  the  one  in  figure  1,  generally  contain  only 
one  topic  throughout,  so  the  present  lack  of  a topic  structuring  process  isn’t  very 
noticeable  for  reference  resolution  in  the  cases  we  have  been  considering  from 
these  dialogues. 

Now  what  do  we  do  with  an  Expression?  Welt,  we  treat  if  much  like  any  other  unit. 
We  create  an  activation  for  it,  and  start  constraining  what  can  fit  in  this  spot  by  adding 
specifications.  In  the  particular  repeated  reference  model  we  are  developing  here, 
there  are  a set  of  rules  for  adding  to  this  specification  of  the  Expression,  which  are 
derived  from  the  heuristics  described  in  figure  3b. 
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So  far,  we  have  specified  the  aspects  of  possible  co-referential  expressions 
(figure  3a)  as  a set  of  rules  for  modifying  activation  rates  within  the  Activation  Model. 
Now  we  have  to  specify  the  other  part  of  the  model,  capturing  the  aspects  of  the 
current  expression  (figure  3b). 

4,2  Kulrs  for  Sporifying  tho  Hoforrnt 

1.  Definitions  of  the  reference  words: 

The  reference  words  (pronouns  A determiners)  themselves  contribute 
specification  of  number  and/or  gender. 

For  the  example,  we  need  to  define  only  the  following  words,  so  that  these 
specifications  can  be  added  to  the  referent  oi  the  expression  that  they  are  in. 

"I"  or  "me"  or  "my"  -->  singular  and  human  and  current  speaker 

"you"  or  "your"  -->  human  and  current  hearer 

"it"  or  "this"  -->  singular 

"these"  or  "those"  -->  plural 

2.  Explicit  modifiers  in  Expressions: 

If  there  are  other  words  in  the  current  expression,  these  further  specify  the 
referent.  These  include  adjectives  and  nouns,  prepositional  phrases,  and 
subordinate  clauses. 

The  following  are  words  which  are  in  expressions  in  the  example  dialogues  and 
which  add  specifications  to  the  referent:  input,  file,  output,  line  printer,  commands, 
comma. 

3.  The  verbs  in  the  same  clause  as  the  current  expression; 

The  kind  of  action  described  by  the  verb  often  adds  further  specifications  to  the 
referent. 

These  are  the  verbs  that  occur  in  the  example  dialogue,  most  of  which  add 
specifications  to  their  arguments:  get,  keep,  xeqt,  grab,  give,  come  out,  throw 
away,  confirm,  query. 

A.  The  clause  that  the  current  expression  is  in: 

The  event  described  by  the  rest  of  the  clause  also  helps  specify  the  referent. 

The  model  then  applies  a match  process  between  the  partial  specification  of  the 
referent  of  the  current  expression  and  the  salient  concepts  in  the  Workspace,  and 
selects  the  most  salient  concept  that  best  matches  the  specification. 

Let  us  now  examine  a hand-simulated  operation  of  this  Activation  Model  on  the 
dialogue  in  figure  1 

For  all  the  instance  of  first  person  pronouns  ("I"  at  1.3,  1.8,  3.6|  "my"  at  1.16; 
"me"  at  1.25),  the  referent  will  be  specified  to  be  the  current  speaker  and  the 
expression  will  acquire  that  person  as  the  referent. 

The  second  pronouns  ("you"  at  4.8;  "your"  at  4.2)  will  all  acquire  the  current 
hearer  at  turn  4 (the  person  L)  as  their  referent  and  thus  be  co-referential  expressions. 

Let  us  focus  on  the  "it"s,  since  these  are  the  most  challenging  cases.  When  the 
"it"  at  1.11  occurs,  there  are  only  a few  activations  in  the  workspace:  oerson  L (rate 
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4.5)  and  Runoff  (rate  4).  "It"  specifies  the  referent  to  be  singular;  the  verb  specifies 
the  referent  to  be  a program.  The  only  match  is  with  Runoff,  so  that  is  acquired  as  the 
referent  concept. 

For  the  "it"  at  1.13,  the  workspace  now  contains  the  same  activations,  but  with 
different  rates:  person  L (rate  7,25)  and  Runoff  (rate  6.5).  The  specification  of  the 
referent  are  singular,  animate,  and  involved  with  input  files.  Again,  Runoff  is  acquired 
as  a referent,  making  the  expressions  at  1.3,  1.11  and  1.13  all  co-referential.  This 
agrees  with  the  annotations  of  the  observer  (figure  2). 

Things  are  a little  more  complicated  for  the  "it"s  in  turn  3,  since  there  are  more 
activations  in  the  workspace  at  that  time.  These  are  the  activations:  specific  output 
(rate  5),  specific  line  printer  (rate  4.5),  person  L (rate  3.3),  non-existent  output  (rate  2), 
the  word  "done"  (rate  1.1),  Runoff  (rate  1.9),  and  specific  input  file  (rate  0.8).  The 
specification  of  the  referent  include  singular,  movable,  valueless  to  oerson  L.  The  best 
matches  are  with  the  specific  input  and  the  specific  output,  and  since  the  specific  output 
is  more  salient,  it  would  be  acquired  as  a referent.  This  differs  slightly  from  the 
observer’s  annotation,  since  he  noted  "it"  as  referring  to  a specific  instance  of  the 
concept  referred  to  by  "the  output" 


Finally,  the  referent  of  "it"  at  3.8  will  be  specified  as  being  singular,  movable,  and 
an  entity  internal  to  computers.  The  state  of  the  Workspace  will  be  similar  to  that  for 
the  previous  "it",  with  the  Activation  rates  of  the  specific  output  higher  and  all  others 
lower.  The  "if"  would  acquire  the  specific  output  as  a referent.  At  one  level  the  two 
"it"s  in  turn  3 are  co-referential,  but  their  low  level  referents  are  different  and  at  this 
level  they  are  not  co-referential.  We  will  examine  this  issue  in  more  detail  in  section  7. 


1.  Complex  Hrprntrd  Hrferritcr 

So  far,  we  have  been  primarily  concerned  with  simple  repeated  reference,  in 
which  the  co-referential  expressions  are  simple  noun  phrases  with  relatively  concrete 
concepts  as  referents.  However,  Expressions  are  often  used  to  refer  in  much  more 
complex  ways.  Our  Dialogue  Modeling  Project  at  ISI  has  investigated  two  kinds  of 
complex  repeated  reference.  Text  Reference  and  Propositional  Reference  (Mann,  et. 
al.,  1975;  Archbold,  1975). 

People  sometimes  use  Expressions  to  refer  to  words  or  phrases  that  have  been 
previously  said.  For  example,  we  can  talk  about  the  last  sentence  or  about  this 
sentence  - two  text  references.  ,°eople  more  commonly  use  Expressions  to  refer  to 
whole  propositions  that  have  been  uttered  previously.  For  example,  I can  refer  to  the 
previous  description  of  repeated  reference  tricks  - a reference  to  a concept  that 
spans  several  pages. 

Those  two  kinds  of  references  and  their  attendant  complexities  are  discussed  at 
length  by  Archbold  (1975).  To  illustrate  some  of  the  many  levels  of  text  and 
propositional  reference,  consider  the  following  (constructed)  examples: 

1.  Four.  That  rhymes  with  score. 

2.  Four.  That  is  a four  letter  word. 

3.  Fourscore  and  seven  years  ago.  That's  eighty  seven  years  ago. 

4.  Four  score  and  seven  years  ago.  That’s  the  opening  phrase  of  a 
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famous  speech  given  in  1863. 

5.  Four  s^v-e  and  seven  years  ago.  That  was  1776. 

6.  Four  score  and  seven  years  ago.  That  was  1888. 

(See  Archbold  (1975)  for  a set  of  real  examples  that  make  this  same  point.) 

Basically,  people  are  able  to  reference  a whole  spectrum  of  concepts,  including 

those  concepts  that  represent  intermediate  results  in  the  comprehension  process. 

Among  these  are;  a particular  feature  of  the  pronunciation  of  words,  a specific  use  of 
a word,  the  word  in  general,  a phrase,  clause  or  sentence.  These  are  all  clumped  under 
the  category  of  text  reference. 

At  a deeper  semantic  level,  people  are  able  to  reference  the  concepts  that 
represent  various  levels  of  comprehension  and  understanding.  These  include:  actions, 

simple  low  level  propositions,  and  larger  scale  units  like  topics.  These  are  all  types  of 
proposition  reference,  and,  like  text  reference,  the  referents  are  intermediate  results  of 
a comprehension  prv,cess. 


5.1  Cnntfilrv  ffrprnird  f{/’frr/<nrr  find  the  Artiiniwn  Modrl 

Existing  models  or  repeated  reference  have  had  difficulty  with  these  more 
complex  kinds  of  reference.  The  few  systems  that  address  propositional  reference  at 
all  handle  it  in  an  extremely  I'mited  and  ad  hoc  way  (for  example,  Winograd's  SHRDLU), 
and  none  tackle  the  problem  of  text  reference. 

What  about  the  Activation  Model  presented  here?  For  this  model  to  find  a 
referent,  the  concept  has  to  be  in  the  Workspace.  If  we  can  get  our  language 
comprehension  processes  to  put  all  their  intermediate  results  into  this  Workspace,  then 
these  results  will  be  temporarily  available  as  possible  referents  of  a text  or 
propositional  Expression.  The  Workspace  will  contain  the  whole  spectrum  of  currently 
active  concepts,  all  of  which  will  fade  away  if  not  referenced  again  soon. 

This  way  of  expanding  the  scope  of  the  activation  model  fits  very  naturally  within 
a recently  proposed  general  framework  for  process  models  (Levin,  1975).  In  this 
framework,  called  Proteus,  all  processing,  at  all  levels,  takes  place  within  such  a 
Workspace.  However,  even  with  more  conventional  language  comprehension  models, 
copies  of  intermediate  results  can  be  added  to  the  Workspace,  thus  broading  the  scope 
of  the  referential  processes. 


5.2  Quotr.% 

Quotes  generally  play  a big  role  in  much  of  the  philosophical  discussion  of  text 
reference.  They  are  generally  interpreted  as  signifying  that  the  word  itself  is  meant, 
rather  than  the  underlying  concept  for  which  the  word  is  the  name. 

However,  in  real  dialogue,  quotes  (which  occur  rarely)  are  used  in  a broad  variety 
of  ways.  (See  the  discussion  of  many  of  these  by  Archbold  (1975).)  One  reaction  to 
the  hodge-podge  of  actual  usage  of  quotes  is  to  retreat  to  the  performance/competence 
distinctions  of  linguists.  However,  once  we  take  the  actual  uses  seriously,  we  can  see 
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that  quotes  are  used  as  a "warning  signal"  to  the  hearer  that  the  interpretation  of  the 
quoted  word  or  phrase  is  meant  to  be  different  from  the  usual  interpretation.  Now,  as 
we  have  seen  in  our  examination  of  text  and  propositional  reference,  there  are  concepts 
at  many  different  levels  in  the  comprehension  of  a word  or  phrase  that  can  be  referred 
to.  So,  as  a first  approximation,  we  can  extend  the  Activation  Model  of  Reference  to 
deal  with  quotes  by  suppressing  the  initial  interpretation  of  a quoted  word  or  phrase, 
thus  allowing  some  less  salient  concept  to  be  selected  as  the  referent.  This  hypothesis 
for  modeling  quotes  thus  approximates  the  wide  variety  of  way^  in  which  quotes  are 
actually  used. 


6.  Non-RrfU'ntrd  Rrfrrr.nrr 

People  often  use  Expressions  to  introduce  concepts  not  discussed  previously. 
One  way  in  which  we  do  this  is  to  refer  to  concepts  not  explicitly  mentioned,  but  which 
are  closely  related  to  those  that  were.  To  account  for  ihis  use  of  Expressions,  Chafe 
(1972)  introduced  the  notion  of  "foregrounding",  in  which  the  mention  Of  a concept  made 
closely  related  concepts  available  for  referencing. 

A classic  (constructed)  example  of  this  is:  "I  rode  a train  today.  I was  allowed  to 
toot  the  whistle".  The  phrase  "the  whistle"  is  a foreground  reference.  We  don’t  just 
want  to  fall  back  on  our  default  action  of  using  the  specification  of  the  referent  ”'f  the 
current  expression  (that  this  is  some  whistle  that  can  be  tooted).  Instead,  we  are  able 
to  further  determine  that  this  is  a specific  whistle,  which  is  controlled  from  the  engine 
cab,  etc.  The  first  sentence  "foregrounded"  the  knowledge  about  trains,  so  that  the 
reference  to  "whistle"  in  the  second  can  be  determined  to  mean  a very  particular  kind 
of  whistle. 

There  are  two  systems  which  allow  kinds  of  foreground  .eferences,  Rieger’s 
inference  component  of  the  MARGIE  system  (Rieger,  197A),  and  the  SAM  system  (Schank, 
1975). 


In  the  MARGIE  system,  Expressions  were  detected  by  the  parser  and  passed  to 
the  inferencing  system  for  resolution.  The  set  of  possible  referents  considered  by  the 
inference  system  included  not  only  those  directly  derived  from  previous  utterances,  but 
also  those  derived  from  any  inference  made  from  these  utterances.  In  this  way,  the 
"inferred"  concepts  were  "foregrounded"  and  thus  available  for  referencing. 

The  SAM  System  (Schank,  1975)  is  an  implementation  using  Scripts  (Schank  & 
Abelson,  1975)  as  a high  level  organization  for  language  understanding  . 

in  comprehending  a particular  set  of  utterances,  a Script  is  found  and  used  to 
guide  further  comprehension.  Since  a Script  is  an  organized  body  of  knowledge,  the 
system  can  use  it  to  generate  expectations  of  future  utterances.  More  interesting  for 
this  discussion,  it  provides  a set  of  "closely  related"  concepts,  all  available  for 
foreground  reference. 
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6.1  Foreground  Reference  and  the  /Iclivniion  Model 

The  Activation  Model  of  Reference  can  be  extended  to  deal  with  foreground 
reference  in  the  same  way  as  we  extended  it  to  deal  with  complex  repeated  reference. 
We  put  all  the  intermediate  results  of  our  inferencing  and  comprehension  processes  into 
the  Workspace.  For  example,  if  we  use  higher  level  organizing  concepts  in 
comprehension,  then  whenever  these  concepts  are  used  in  comprehending  an  input, 
they  will  be  put  into  the  Workspace,  and  thus  all  the  components  will  be  available  for 
referencing. 

This  proposed  solution  illustrates  (and  utilizes)  the  dependence  of  reference 
processes  on  other  comprehension  processes.  Whenever  some  new  comprehension 
process  is  developed,  the  capabilities  of  ihe  referencing  processes  will  also  be 
expanded  if  the  "results"  of  the  new  processes  are  added  to  the  Workspace. 

6.2  Failure  to  Find  a Referent 

What  if  we  shll  don’t  find  a referent  concept?  We  already  have  a partial 
specification  of  the  referent,  and  in  many  cases,  this  is  all  we  need  to  know  about  this 
concept.  Many  Expressions  seem  to  need  no  definite  referent  at  a"  for  comprehension 
to  proceed  satisfactorily.  These  cases  are  explored  by  Martin  (1975)  in  some  detail.  In 
considering  these  cases,  he  developed  a generalization  of  the  notion  of  a simple  pointer 
to  a concept  as  the  "referent"  of  an  expression.  These  are  "descriptions"  with  varying 
degrees  of  detail.  These  descriptions,  which  are  partially  specified  referents,  can  be 
utilized  in  performing  inferences,  and  also  stored  as  knowledge  known  about  the 
concept. 
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7.  Heferent.i  and  Reference 

So  far  in  this  paper,  the  term  "referent"  has  been  used  128  times.  Let  us  now 
examine  explicitly  how  this  term  is  being  used,  and  therefore,  what  position  on 
reference  has  been  implicitly  assumed. 

We  have  talked  about  Expressions  as  sets  of  words  which  have  concepts  as  their 
referents.  One  might  be  tempted  to  claim  that  the  referent  concept  for  an  expression 
is  tiie  "moaning"  of  that  expression.  However,  we  have  seen  several  ways  in  which  this 
has  to  be  modified.  First  of  all,  the  particular  concept  which  is  the  referent  of  an 
expression  depends  on  the  current  context  - the  same  expression  can  have  two 
different  referents  in  two  different  contexts.  Given  this  observation,  one  might  be 
tempted  to  view  the  referent  of  an  expression  as  entirely  context-dependent,  and 
therefore  that  expressior.-  by  themselves  have  "no  me.ming". 

However,  our  further  investigation  of  complex  reference  and  non-repeated 
reference  have  led  us  to  a pos'^'ble  reconciliation  of  these  two  views  of  reference. 
Instead  of  a single  referent  concept,  we  found  that  an  expression  has  a whole  family  of 
referents  at  many  different  levels.  As  the  comprehension  of  a given  expression 
proceeds,  intermediate  interpretations  of  the  expression  are  generated,  at  successively 
more  abstract  levels.  The  initial  referents  represent  the  surface  characteristics  of  the 
expression  - the  set  of  component  letters,  the  shape  or  sound  of  the  words. 
Successive  levels  include  the  low  level  semantic  specifications  - number,  gender,  the 
superset  conce|:ts. 
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The  lower  level  referents  of  a given  expression  will  be  the  same  across  context, 
while  the  more  abstract  referents  will  differ  from  context  to  context. 

Now,  we  can  consider  two  expressions  to  be  co-referential  whenever  they  have 
identical  referent  concepts  at  some  level.  That  Is,  if  expressions  El  in  context  Cl  has 

referents  (R1 1,  R12,  R13 RIN)  and  expression  E2  in  context  C2  has  referents  (R21, 

R22,  R13,  ...  , RIN),  then  the  two  expressions  are  co-referential  because  they  have  the 
same  referent  concepts  at  level  three  and  beyond. 

For  example,  "Me.'ina  del  Rey"  and  “Marina  dnl  Ray"  are  co-referential  a a fairly 
low  level;  "La  Jolla",  "the  place  where  UCSD  is  located",  and  "The  Jewel  of  the  Pacific" 
are  co-referential  at  a slightly  higher  level,  and  "it"  and  "Runoff"  are  co-referential  at  a 
higher  level  in  the  utterance  "How  can  I get  Runoff  to  work?  I keep  xeqtn  it  ...". 

Co-referential  expressions  are  thus  "the  same"  above  a certain  level,  but 
different  below  that  level.  This  difference  is  the  reason  why  statements  like  "La  Jolla  is 
The  Jewel  of  the  Pacific"  aren’t  empty  tautologies  ( X is  X ). 

A referent  concept  is  then  one  of  the  family  of  concepts  that  represent  an 
expression  at  some  level. 


fl.  Summary 

In  this  paper,  we  have  examined  a number  of  the  repeated  reference  heuristics 
used  by  language  understanding  systems.  After  observing  the  difficulties  that  existing 
models  have  with  repeated  reference  in  real  dialogues,  we  divided  these  heuristics  into 
two  categories,  those  dealing  with  aspects  of  the  possible  co-referential  expressions, 
and  those  dealing  with  aspects  of  the  current  expressions.  Working  from  this 
categorization,  we  were  able  to  propose  an  Activation  Model  for  simple  repeated 
reference,  and  then  to  extend  it  to  text  and  propositional  repeated  reference,  and 
finally  to  non-repeated  reference.  Some  general  issues  of  reference  were  examined  in 
light  of  the  Activation  Model. 
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REPEATED  TEXT  AND  PROPOSITIONAL  REFERENCE;  CONCEPTS  AND  DETECTION 
INTRODUCTION 

I.  INTRODUCTION 


When  people  engage  in  dialogue,  they  quite  frequently,  at  some  point,  refer  back  to 
and  talk  about  something  that  has  already  been  said  in  the  dialogue,  or  refer  forwards  to 
something  that  is  about  to  be  said.  In  some  cases,  they  may  refer  to  a string  of  words 
uttered  at  some  nearby  point  in  the  dialogue,  thereby  making  a "(rrprnird)  irxi  rrfrrrnrc". 
in  other  cases,  they  may  talk  about  some  state  of  affairs,  sorrie  statement,  some  belief, 
which  is  described,  made,  expressed  or  referred  to  nearby  in  the  dialogue,  thereby  making 
a "(rrprated)  propositional  roferenro". 


Both  text  reference  and  repeated  propositional  reference  are  of  interest  to  the 
Dialogue  Analysis  Project.  Two  questions  immediately  arise  concerning  these  dialo.gue 
phenomena,  however.  First  of  all,  there  is  the  conceptual  problem:  (a)  is  it  possible  lo 
define  these  phenomena  at  all  clearly?  (b)  if  there  are  several  different  feasible  definitions 
available,  which  should  the  Dialogue  Analysis  team  choose,  given  its  goals  and  interests? 
Secondly,  there  is  the  detection  problem.  Given  a definition  of  these  tv/o  phenomena,  hov/ 
can  one  determine  what  expressions  may  be  involved  in  them,  and  which  particular 
expressions  are  involved  in  a particular  instance  of  thorn? 

The  present  paper  addresses  these  two  problems.  The  first  section  will  survey 
some  philosophical  and  linguistic  literature’s  treatment  of  the  notions  of  "text"  and 
"proposition"  and  of  problems  associat  J with  these  notions.  The  available  notions  will 
then  be  discussed  and  evaluated  in  the  light  of  the  team’s  interests  and  goals,  and  in  vio’.v 
of  some  of  the  data  that  the  team  will  have  to  account  for.  The  second  section  v/ill 
discuss  various  "clues"  v;hich  might  be  relied  upon  to  deterraine  whether  a given 
expression  is  being  used  in  a dialogue  to  make  a text  reference  or  a repeated 
propositional  reference.  The  focus  will  be  upon  the  question:  are  there  any 
operaticnalizable  procedures  for  detecting  pairs  of  expressions  which  are  involved  in 
repeated  text  Or  propositional  reference? 


propositional  REFERENCE:  CONCEPTS  AND  ULItCT|ON 
THE  NOTIONS  OF  TEXT  AND  REPEATED  PROPOSITIONAL  REFERENCE 


SECTION  I: 

CONCEPTS  OF 
text  REFERENCE 
AND 

REPEATED  PROPOSITIONAL  REFERENCE 


In  this  section  the  concepts  of  text  reference  and  of  repeated  propositional  reference  will 
be  examined,  first  as  they  are  presented  in  some  philosophical  and  linsuistic  tradilions.  ,irri 
then  in  view  of  their  operationalizability  for  the  team. 


REXPR 


TREE 

PREF 

RREF 


RPREF 


Below,  we  shall  use  the  following  terminology  and  abbreviations: 

a referring  expression,  i.e.  an  expression  which  is 
used  by  a speaker  in  a dialogue  to  refer  to,  mention, 
or  pick  out  an  object  or  a set  of  "objects"  (where 
object  is  taken  in  its  widest  possible  sense,  to 
include  physical  objects,  people,  states  of  affairs, 
events,  actions,  processes,  abstract  constructs  such 
as  the  numbers  or  the  quality  of  beauty,  intentional 
objects  such  as  thoughts  or  beliefs,  hypothetical 
or  fictional  objects  such  as  the  child  X and  Y would 
have  conceived  together  had  they  not  separated,  or 
Pegasus,  and  so  on), 
text  reference 
propositional  reference 

repeated  reference:  the  use  of  two  separate  linguistic 
expressions  to  refer  to  (designate,  mention,  pick  out, 
etc.)  the  same  "object".  The  first  referring  expression 
involved  we  call  "the  anlecedent  referring  expressioii" 
(AREXPR);  the  later  of  the  two  expressions  involved  we 
call  "the  consequent  referring  expression"  (CREXPR). 
repeated  propositional  reference. 


1.1  TEXT  REFERENCE 


1.1.1  NOTIONS  OF  TEXT  REFERENCE  IN  THE  LITERATURE 

In  the  literature,  the  notion  of  text  reference  is  closely  bound  to  the  distinclion 
between  the  use  of  language  ond  Ihe  monlioning  of  language.  Wc  nrclinanly  nsr  icing.u.i  '.u 
to  talk  about  the  world  ; but  v.'c  may  also  use  language  lo  talk  about  language,  to  discuss 
linguistic  expressions  - words,  phrases,  sentences,  or  y.'hole  texts  or  discourses.  In  the 
latter  case,  we  nwntimi  or  refer  to  linguistic  expressions  in  order  to  say  something  about 
their  phonetic,  orthographic,  syntactic,  semantic,  logical  or  pragmatic  features.  [1] 
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Traditionally  it  was  thought  that  the  use  and  mention  of  expressions  were  very 
no^nlensraY;  distinction  was  te  run  the  risk  of  creating  cuch 


or 


My  dog  contains  three  letters. 
My  "dog"  is  a hearty  eater. 


Though  it  was  allowed  that  one  could  inention  expressions  by  the  use  of  oroner 

" H L ">»  Of  to.t  "olero  0 LTZ\ 

reforooce  by  oootaboo.  A ouolaMon-expreoo.on  - I.e.  a qootod  o.preaaioo  ploa  Uo 
quotnt?s.'^"  expression  within  the 


There  is  an  important  ambiguity  in  the  notion  of  mentioning  expression-  Te-t 
.oforcnco  is  not  reference  only  to  individual  sounds  or  inscriptions.  When  we  refer  to 
linguistic  expressions,  we  may  refer  cither  to  tokens  or  to  types.  7’o/.m;„  arc  particular 

'uch' thm^a"/  ° t -^ot  single  homogeneous  entities  (there  is  no’ 

-uch  hin„  a.  a type-word),  but  are  rather  classes  of  individual  tokens,  grouped  tooctlior 
by  reference  to  some  set  of  taxonomic  criteria  [2],  which  can  be  referred  to  by  the  use  of 

ftn  '•"'"Suage  can  be  used  to  talk  about  natural  language  expressions 

to  t.iK  .bout  Itself"  ,n  som,e  sense)  leads  to  well  known  antinomies.  Con;ider  the  i e 
sen  ence.  The  third  sentence  of  footnote  1 of  this  paper  is  false.  Is  the  precedm- 
.entence  true  or  false  or  both  or  neither?  In  order  to  avoid  such  paradoxes,  looi-ians  who 

diTn.  h H '-'h  c;ndrtion"ha!fe 

d .hn^uished  between  the  language  they  examine  - the  object-language  - and  the 

^uage  they  use  >n  their  analyses  - the  meta-language.  Cf.  Alfred  Tarski  "Th- 
Semantic  Conception  of  Truth",  in  Leonard  Linsky,  ed..  "Semantics  and  the  Philosophy  of 
Language  , University  of  Illinois  Press,  Chicago,  1952.  ^ ^ 

[2]  Two  remarks  are  in  order  here.  First,  we  may  very  well  be  unable  to  expliciti  ^ 
specify  some  or  most  of  the  criteria  which  we  use  to  group  tokens  into  classes,  i.e.  to 

thina  fhaT  r t Secondly,  it  is  probably  wrong  to 

' o Je  sort  tokens  into  types  by  reference  only  to  orthographic  or  phonic  critr  na 
without  any  consideration  of  their  semantic  role  What  we  ,-efer  to  when  we  perform  a 
text  reference  to  a token  is  a sound  or  a mark  which  belongs  to  a languaoc,  which  was 
de-mbma  ''"Suistic  purpose,  or  was  produced  in  a context  in  which 

e!nn  n relative  to  a framework  in  which  such 

event,  or  marks  can  be  systematically  described,  is  an  appropriate  activity  to  en^a^e  in 

unon'Ih  °^‘^°Sraphic  differences  between  tokens  which  would  have  no  irnpact 

the  Idenmv  r"p  '"'T'"  — "Systematically  Ignored  Differences  and 

the  Identity  of  Propositions  , roundations  of  Language,  Vol  12,  No.  1,  September  197A 
oectiOn  Z.  ’ 


I 

i 


::s=r;r::ro;:t'  ;:,r  n r " 

by  means  of  ostension  or  definite  de^criotion  1 n ^ ° ‘bem  out 

name.  For  example,  the  mark  on  tL  nc^  i;!"’  ' P''°P- 

animal 

toKonsi  opccics-words^ou^d  b^t^rm^d 'brcpoir^'^|I°  '°  individual 

between  the  letters.  ^ -Pollins  the  token  backwards,  inserting  dots 

Thus  if  the  tv.-o  marks  on  the  next  line 
animal  towards 

r,r;;"r:r;,r  s ';r  

distinction  between  tokens  and  types  one  musT'''^'’  f "-.e 

quotation-marks,  and  separate  Out  those  ca-e- in  wh  T ^ ?T  " 

a token,  and  when  they  form  the  name  of  a Won  r form  the  name  of 

token  belongs.  ° which  the  individual  quoted  expression 

Let  us  consider  some  examoles  of  TRFF  ti 

concocted  for  purposes  of  illustration  - invJv.  / below  - all  partly  or  wholly 

refer  to  the  same  text;  such  repeated  r^f^^lnce^^^^  T":  expressions  which 

being  performed.  -m>ply  designed  to  highlight  the  TREFs 

(1)  A;  The  third  letter  from  the  top  of  the  list  on  the 

blackboard  is  "D". 

0;  Yes,  I know.^  Its  v/ritten  in  red  chalk.  But 

Uan't  make  out  the  letter  directly  below 

(2)  A;  John  Smith  ,s  calling  himself  "Hiroto  Texagewish" 

these  days. 

0:  His  assumed  name  is  cerlainly  hard  to  pronounce! 

(3)  A.  On  checMi.,  G/s  j 

di.cible  all  jets  on  tv/o  adjacent  quads"  Is 

R.  Wn  'm-  '^"^'"2  '■’bout? 

0.  No,  disable  all  jets  on  two  adjacent  quads"  is  not 
what  I was  referring  to. 

B-  "’ASSWORD).  job  NUJ/OFR" 

TO  LS"  'r 

(5)  A:  John  saw  „e  a =ood  exampio  of  , lonaue-l^islej 

yesterday.  What  was  it  now?  Ah  yes  I 
rememnber.  How  much  wood  would  a wood-chuck 

B-  Well  ‘h  I " "^°°rJ-chuck  would  chuck  wood'' 

0.  Well,  .he  tongue-twister  he  proposed  was  earner 

to  pronounce  than  the  one  Mary  came  up  with. 

[3]  L.  Goddard  and  R.  Roiitlpv  "i  im  » 

Philosophy,  Vol.  44,  No.  1,  Ma"i  1966.  Quotation",  The  Australasian  Journal  of 
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(6)  A:  You’re  coming  in  a little  louder  now. 

B:  Fred,  you’ll  have  to  say  that  again.  I couldn't 

hear  you;  there’s  too  much  background  noise. 
A;  Okay.  I said,  you’re  coming  in  a little  jouder 
now. 


Consideration  of  the  above  examples  leads  to  the  following  remarks: 

(1)  The  linguistic  expressions  which  are  used  to  refer  to  (mention)  other  linguistic 

expressions  are  of  two  types.  The  first  type  trxhihits  or  rci)licat(!s  the 
expression  to  which  it  refers.  Thus  in  the  two  following  sentences 
John  wrote  "Today’s  lesson  is  Chapter  V"  on  the  board. 

Should  I type  TTY  or  TTY;? 

the  underlined  expressions  are  used  to  refer  to  expressions  which  they 
replicate  or  exhibit.  The  difference  between  exhibiting  and  replicating  can  be 
shown  by  the  following  examples: 

The  following  letter,  “D",  was  not  written  by  hand. 

(exhibition) 

Yesterday,  John  wrote  "Today’s  lesson  is  Chapter  V" 
on  the  blackboard. 

(replication) 

In  the  first  case,  one  is  exhibiting  a token  and  talking  about  that  very  token, 
and  no  other.  In  the  second  case,  one  is  talking  about  a token  John  produced 
yesterday,  and  one  is  doing  so  by  exhibiting  a token  of  the  same  type  as  the 
token  he  produced  yesterday;  in  that  sense  one  is  replicating  a token,  by 
producing  and  exhibiting  a "copy",  a token  of  the  same  type. 

The  second  type  of  linguistic  expression  v/hlch  is  used  for  TREF  neither 
exhibits  nor  replicates  the  expression  to  v/hich  it  refers.  An  example  of  such 
an  expression  would  be  "The  sentence  John  wrote  on  the  board  yesterday". 

(2)  Exhibiting  or  replicating  text  referential  expressions  may  or  may  not  use 

quotation  marks:  compare  the  examples  (1-3)  and  (^-6).  This  fact  flies  in  tl',e 
face  of  the  traditional  view  according  to  which  one  only  exhibits  or  replicates 
expressions  to  which  one  wants  to  refer  by  using  quotation-marks, 
quotation-marks  being  seen  as  operators  which  form  a proper  name  of  the 
string  enclosed  within  them.  One  can  exhibit  or  replicate  a linguistic 
expression  not  only  by  quotation,  but  also  by  capitalization,  pauses, 
indentation  or  spacing,  intonation  and  many  other  v/ays  besides.  There  thus 
seems  to  be  no  one  reliable  orthographic  sign  of  exhibition  or  replication; 
there  may,  however,  be  a list  of  such  signs.  In  any  case,  the  function 
traditionally  assigned  to  quotation  marks  can  be  performed  by  ir,any  other 
marks  or  sounds.  (One  should  also  note  that  quotation  marks  are  sometimes 
used  in  complex  ways  which  bear  only  a faint  resemblance  to  their 
stereotypical  use,  as  when  they  are  employed  in  ironical  remarks,  as  shudder 
quotes  or  snigger  quotes.  Exam, pies:  "These  days,  one  never  gets  price 
stability,  only  decreases  in  the  rate  of  increase  of  inflation.  That’s 
"progress".",  or  Lenin’s  statement  "We  v^ill  "support"  the  Mencheviks  as  the 
rope  supports  the  neck  of  the  hanged  man!") 
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(3)  Text  referential  exprcooions  which  are  not  of  the  "exhibition"  or  "re-plicatioii" 
types  may  be  (a)  pronouns  or  (b)  noun  phrases.  The  noun  phrases  may  be 
proper  nouns  (think  of  referring  to  a string  of  marks  on  a piece  of  paper  as 
"Tom"),  (modified)  common  nouns  ("a  password",  "the  loud  shout")  v.'ith  or 
without  relative  clauses. 

(A)  Text  referential  expressions  which  are  not  of  the  "exhibition"  or  "replication" 
type  may  be  combined  with  those  that  are  to  form  complicated  text  referring 
expressions.  Consider  example  (4),  where  "(PASSWORD),  job  number"  arc 
used  alongside  two  replication  text  expressions.  It  is  interesting  to  note  th.'t 
the  order  in  which  these  referring  expressions  occur  is  the  same  as  the  order 
in  which  the  referents  ci  each  individual  expression  must  occur  in  order  to 
form  a token  of  the  type  which  the  entire  string  "ATT  LINKER  (PASSWORD), 
job  number"  denotes. 


One  might  conclude  that  text  reference  is  a /n/re  mention  of  phonetic  or  orthogfophic 
tokens  or  classes  of  tokens.  However,  the  traditional  clearcut  distinction  between  use  and 
mention  has  been  attacked  of  late  - and  justifiably  so.  Things  are  not  so  sircple. 

'There  are  ...  many  sentences  in  which  an  expression  is  both 
introduced  and  is  also  used;  in  particular,  sentences  which  are 
used  to  convey  both  linguistic  and  factual  information.  This  is 
especially  so  in  sentences  containing  the  words  'call’,  'distinguish', 

’determine’,  'is  called’,  ...  'satisfy’,  or  compounds  of  such  'words. 

Consider,  e.g.  'That  sleek  red-coated  dog  is  Rover’,  'The  "Queen 
Elizabeth",  which  is  so-named  (so-cal'ed)  after  the  present  Queen 
Mother,  sailed  for  Southampton  yesterday’,  'What  is  .lalva'f’,  'Call 
her  a shrew’. 


'If  triangles  are  taken  as  three-sided  figures  then  they 
have...’,  and  also  indirect  speech  forms  in  which  the  speakers 
actual  words  are  reported."[4] 

There  arc  sentences  containing  text  references  in  which  the  text  refered  to  rriust  not  only 
be  considered  as  an  uninterpreted  orthographic  object,  but  also  be  "read  v/ith 
understanding",  i.e.  interpreted.  Some  examples  of  such  sentences  are: 

(1)  The  sign  says,  "George  Vi/ashington  slept  here",  but 

I don’t  believe  he  ever  did. 

(2)  Whenever  Fred  sighs  "Boy,  do  I need  a drink",  he  expects 

you  to  fix  hirri  one. 

(3)  What  he  actually  said  was  "It’s  clear  that  you’ve  given 

this  problem  a great  deal  of  thought",  but  he  meant 
quite  the  opposite. 

(4)  "I  talk  better  English  than  both  of  youse",  shouted 

Charles,  thereby  convincing  me  that  he  didn't. 

In  all  of  the  above,  the  presence  of  pronomlnalization,  ellipsis  or  semantic  anaphora 

[4]  L.  Goddard  and  R.  Routley,  op.cit.,  pg.  22. 
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involving  terms  both  within  and  outside  of  the  quotation  marks  shows  that  the  quoted 
sentences  must  have  been  interpreted.[5]  Thus  if  vre  say  that  text  reference  involves  the 
mention  of  a linguistic  expression,  me  must  allow  that  such  mciuioii  may  he  ar.romfminod 
hy  use,  and  ihaL  therefore  uie  must  eount  a$  text  reference  not  only  reference  to  loUens  or 
tyfics  alone,  hut  also  reference  to  tokens  or  types  paired  with  their  meanin/r,  some 
interpretation,  the  statement  they  were  used  to  make  on  some  occasion,  or  the  proposition 
they  express. 


1.1.2  THE  TEAM’S  INTEREST  IN  TEXT  REFERENCE. 


In  light  of  the  concepts  introduced  above,  let  us  now  consider  the  notion  of  text 
reference  which  the  team  is  interested  in. 

In  Observation  Methods  for  Human  Dialogue",  we  find  the  following  explanations  of 
the  concept: 


...a  so-called  'Text  Reference’,  made  to  a string  of  words  in  the 
preceding  dialogue  itself  (and  not  to  the  referent  of  that 
preceding  string  of  words!)."  (pg.  22) 

A Text  Reference  occurs  whenever  reference  is  made  to 
previously  occurring  words  within  the  transcript.  For  example, 
in  the  sentence  ’Go  3 blocks  and  turn  North;  by  North  I mean 
towards  the  mountains.’  the  second  use  [i.e.  occurrence  - A.A.] 
of  "North"  is  a Text  Reference  to  the  first.  We  call  this  a Text 
Reference  because  it  refers  to  the  previous  usr-  [occurence]  of 
the  word  itself,  rather  than  to  its  meaning."  (pgs  26-27) 

These  passages  indicate  that  Ihe  team  is  interested  only  in  singular  text  references  to 
words  previously  used  in  the  dialogue.  Note  that  if  we  interpret  these  indications  strictly, 
we  rriust  conclude  that  only  TREF  to  previously  used  tokens  are  to  be  focused  upon  - since 
word-tokens,  and  not  classes  of  word-tokens,  are  used  at  o particular  tirrie  and  place  by 
participants  in  a dialogue.  On  this  strict  inlerpretation,  the  example  given,  viz. 

Go  3 blocks  and  turn  North; 

by  North  1 mean  towards  the  mountains. 

is  of  interest  only  if  one  reads  the  second  sentence  as  equivalent  to  'I  intended  to  use  tlie 
token  of  type  North'  which  occured  in  my  previous  sentence  to  mean  towards  the 
mountains.  It  would  not  be  of  interest  if  the  second  sentence  were  read  as  equivalent  to 
'All  tokens  of  type  "North"  which  I utter  1 use  to  mean  towards  the  mountains’,  for  in  that 
case  the  second  occurence  of  "North"  would  refer  not  to  the  previous  token  but  to  a class 
of  tokens  of  which  the  previous  token  is  a member. 

However,  examination  of  the  examples  given  m "Observation  Methods  in  Human 
Dialogue"  and  discussion  with  team  members  indicate  that  they  are  concerned  with  a much 
wider  variety  of  phenomena.  What  they  are  concerned  with  and  what  they  wish  to  refer 

[5]  Barbara  Hall  Partee,  "The  Syntax  and  Semantics  of  Quotation",  in  S.R.Anderson  and  Paul 
Kiparsky,  "A  Festschrift  for  Morris  Halle",  Holt,  Rinehart  and  Winston,  New  York,  1973. 
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to  by  the  term  "Text  lipforonro"  ooniisui  of: 

(1)  Siiifrulnr  irxi  rrfrrcnnr.  whoro  nil  I’v/iri’.wioii  i.<  usrrl  to  refer  to  either  (a)  n token 

whie.h  ix  used  either  before  or  ofter  it  in  the  tlinlonne,  nr  (h)  a elnsx  of  tokenx 
of  which  our  or  iiiore  tokenx  uxed  in  the  dinloRue  ore  niemhers. 

(2)  lieiieoted  text  referenee.  where  the  /UiEXPli  and  the  Clil'.SPli  refer  to  either  (n  > 

the  xntnr  token,  or  (h)  the  xutne  />•/)<?,  or  (e)  a ty'e  and  n token  which  is  a 
ineinher  of  that  ty/ie,  or,  lastly,  (d)  two  token-elnsxes  n.e,  iyiiex\  one  ol 
whieh  ix  a xnhxet  of  the  other.  Either  or  both  of  the  referents  of  the 
/lEEXPE  and  the  CKEXPE  may  either  he  or  eontain  as  a ineinher  a token 
uxed  in  the  dinlo^ue. 


We  may  repeat  here  what  we  have  strecsed  above;  if  an  expresoion  El  ii  uoed  in  n 
dialogue,  and  is  elsewhere  referred  to  by  means  of  another  expression  E2,  E2  is  a 
text-referring  expression  which  is  involved  in  a singular  text  reference  (in  the  sen  .r.  of 
(1))  if  it  is  used  to  refer  to  El  as  a token,  i.e.  if  it  is  uoed  to  refer  to  some  in^criptional 
or  orthographic  features  of  El.  However,  E2  may  also  (at  the  same  time)  be  used  to  refer 
to  the  moaning,  the  referent,  or  some  other  non-textual  feature  of  El.  Text  reference 
does  not  exclude  interpretation. 
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1.2.1  PROPOSITIONS  AND  PROPOSITIONAL  REFERENCE 


Whoreas  the  notion  of  "text”  is  relatively  unproblematical  and  unconfusins,  given 
certain  simple  distinctions,  the  notion  of  “proposition”  is  a difficult  one  to  formulate  clearly. 
Unlike  "text",  "proposition"  is  very  much  a technical  philosophical  term.  We  find  three 
main  concepts  of  what  a proposition  is,  formulated  and  used  by  (1)  linguists,  (2)  logicians 
or  philosophers  primarily  concerned  with  logical  matters,  and  (3>  by  speech-act  theorists. 

1.2. 1.1  Those  linguists  who  make  use  of  the  term  "proposition"  often  equate  it  v/ith 
the  wrnniiifr,  rondiup;  or  st'iiiniitir.  inif^r/irrinlioii  of  n .triiinirn  . Let  us  consider  for 
example  the  Katzian  tradition  of  transformational  semantics.  In  this  tradition,  the  meaning 
or  semantic  interpretation  of  a sentence  is  a set  of  sets  of  structured  markers  which  are 
assigned  to  the  sentence  on  the  basis  of  (a)  the  semantic  markers  assigned  to  the 
component  words  by  a dictionary,  (b)  the  syntactic  structure  of  the  sentence,  and  (c)  a set 
of  semantic  "projection  rules"  The  semantic  interpretation  of  a sentence  is  a theoretical 
construct  which  is  adequate  if  In  conjunction  with  the  rules  of  a semantic  theory  can 
predict  the  semantic  properties  of  sentences  (such  as  synonymy,  ambiguity,  redundancy, 
presupposition,  entailment,  and  so  on).  Each  set  of  structured  markings  is  a "reading"  or  a 
"proposition"  If  a sentence  is  assigned  several  sets  of  structured  markings  i!  is 
ambiguous  and  is  said  (according  to  the  semantic  theory  which  assigns  the  markings)  to 
cx/irrss  ^ovornl  diff<Triit  propositions  . 

"Sentences  are  frequently  ambiguous,  that  is,  they  express  more 
than  one  sense.  Thus,  v^e  shall  frequently  say  that  an  n-v/ay 
ambiguous  sentence  expresses  n distinct  propositions.  (We  have 
lakon  the  term  ’meaning’  to  refer  to  the  sum  of  the  propositions 
expressed  by  a sentence  ...).  We  also  understand  'proposition’  to 
convey  what  synonymous  sentences  have  in  common  by  virtue  of 
which  they  are  synonymous.  Sentences  that  are  synonymous  on 
a sense  [i.e.  on  one  of  their  readings  - A. A.]  are  thus  said  to 
express  the  same  proposition,  and  fully  synonyrrious  sentences 
are  said  to  express  the  same  set  of  propositions.  Semantically 
anomalous  sentences  express  no  proposition  at  all. "[6] 

Note  that  if  one  regards  propositions  as  the  'readings’  o sentences,  one  ?llov/s  that  not 
only  declarative,  but  also  interrogative,  imperative  and  hortatory  sentences  express 
propositions. 

1.2. 1.2  The  logicians’  notion  of  proposition  is  distinct  from  the  linguistic  notion,  at 
least  prima  facie.  Logicians  are  concerned  with  formal  constraints  on  inference,  and  with 
the  notions  of  truth  and  falsity  which  are  needed  to  account  for  formal  validity  of 
arguments.  They  have  thus  made  use  of  a concept  of  "proposition"  which  is  tantamount  to 
the  notion  of  a "iruilt-vrlticlo"  : 

"On  the  logical  account  of  propositions,  propositions  have  been  regarded,  first 
and  foremost,  as  truth-vehicles.  That  is,  propositions  are  taken  io  be  either 
the  things  or  some  of  the  things  which  are  true  or  false.  There  are  several 


[6]  Jerrold  J.  Katz,  "Sernantic  Theory",  Harper  and  Row,  Now  York,  1972,  pg.  120. 
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motivations  for  ihis  view  of  propositions.  One  historically  iruportant  motive 
has  been  to  provide  a subject  matter  for  logic,  something  for  logic  to  bo 
about.'  Logic  is,  in  the  first  instance,  the  study  of  inferences.  Whether  an 
inference  is  valid  or  invalid  depends  neither  on  the  particular  subject  of 
discourse  nor  on  the  determinate  mode  of  linguistic  expression  but  solely 
upon  the  formal  relations  between  premises  and  conclusion.  Propositions 
may  thus  be  conceived  as  sorts  of  entitles  which  stand  necessarily  in  such 
relations  as  entailrnent  and  contradiction,  and  it  is  these  relations  which 
constitute  the  grounds  of  va.id  and  invalid  inference  and  are  reflected  in 
particular  linguistic  embodiments.  ... 

A second  motivation  for  the  logical  account  of  propositions  is  found  in 
the  classical  correspondence  theory  of  truth.  On  this  view,  truth  is  regarded 
as  a relation  between  what  is  the  case  in  the  world,  the  facts,  and  the  thing, 
whatever  it  is,  which  is  true.  Propositions  have  traditionally  been  cast  in  the 
role  of  ti.e  second  term  of  this  relation."  [7] 

Thus  on  the  logicians’  view,  as  on  the  linguists’,  there  is  a distinction  betvreen 
sentences  - i.e.  strings  of  inscriptions  or  sounds  which  belong  to  sorne  language  - and  the 
propositions  which  sentences  express.  Thus  the  followl.ng  distinct  sentences  (taken  either 
as  tokens  or  types) 

(1)  The  moon  is  smaller  than  the  sun. 

(2)  The  sun  is  larger  than  the  moon. 

(3)  La  lime  est  plus  petite  que  le  soleil. 

(A)  Le  soleil  est  plus  grand  que  la  lune. 

would  be  said  to  express  the  same  proposition,  and  to  be  true  because  they  all  express 
the  same  true  proposition. 

One  may  to  some  extent  distinguish  between  a traditional  logician’s  ujtion  of  a 
proposition,  and  more  recent  notions. 

1.2.1. 2.1  My  own  stereotypical  characterisation  of  the  traditional  notion  is  as 
follows: 

(1)  a proposition  is  an  abstract  object; 

(2)  there  are  non-denumer<ibly  many  propositions: 

(3)  propositions  exist  independently  of  language  (ie.  there  are  many  propositions 

which  are  not,  may  never  be,  and  perhaps  could  not  be,  expressed  by  a 
sentence; 

(A)  a proposition  is  something  which  is  itself  true  or  false  in  an  absolute  (timeless) 
sense; 

(5)  sorne  propositions  are  expressed  by  declarative  sentences; 


T7]  Rosenberg,  Jay  F.,  and  Travis,  eds.,  Charles,  "Readings  in  the  Philosophy  of  Language", 
Preniico-Hall,  New  Jersey,  1971,  pgs.219-220. 
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(6)  if  tv;o  sentences  ex'press  the  same  proposition,  then  they  are  analytically 

equivalent  (ie.  their  meanings  constrain  their  truth  conditions  in  such  a way 
that  they  are  either  both  true  or  both  false  in  any  situation,  state  of  affairs, 
or  "possibie  world"); 

(7)  however,  if  two  sentences  are  analytically  equivalent,  they  need  not  express  the 

same  proposition:  analytical  equivalence  is  less  stringent  a requirement  than 
identity  of  proposition  expressed  (thus  though  'John  is  a bachelor’  and  'John 
is  an  unmarried  adult  male  human  being’  are  analytically  equivalent,  we  do  not 
want  to  have  to  say  that  they  express  the  same  proposition); 

(8)  if  two  sentences  are  logically  equivalent,  i.e.  have  the  same  truth-values  in  all 

models,  then  they  need  not  express  the  same  proposition  (for  example,  'x=x’ 
and  'Fx  or  >-Fx’  are  logically  equivalent,  because  true  in  all  possible  worlds, 
but  we  do  not  want  to  say  that  they  express  the  same  proposition); 

(9)  propositions  are  what  are  believed,  doubted,  hoped  for,  etc.,  i.e.  they  are  the 

objects  of  belief,  doubt,  hope  and  the  other  so-called  "propositional  attitudes". 

1.2.1. 2.2  The  traditional  logical  notion  of  proposition,  though  it  can  be  rriade  precise 
to  come  degree  [8],  has  been  much  criticised  as  ontologically  unnecessary  and  obscure  [9]. 
Logicians  are  interested  in  what  is  true  or  false.  What  we  usually  term  true  or  false  are 
sentences  uttered  by  peo/Ae  in  certain  eontexts  and  interpreted  in  eertnin  v.nys  . It 
was  argued  that  to  postulate  the  existence  of  propositions  above  and  beyond  uttered 
sentences  was  simply  to  complicate  furthur  an  already  vexing  question  with  such  additional 
quandries  as  the  exact  nature  of  the  relationship  between  propositions  and  the  sentences 
which  "express"  them.  It  was  felt  that  the  real  problem  is  to  deterimine  the  nature  of  the 
dependency  of  the  truth  of  sentences  upon  the  context  of  their  use: 

"A  sentence  is  not  an  event  of  utterance^  but  a universal:  a 
repeatable  sound  pattern,  or  repeatedly  approxirnable  norrri. 

Truth  cannot  on  the  whole  be  viewed  as  a trait,  even  a passing 
trait,  of  a sentence  merely;  it  is  a passing  trait  of  a sentence  for 
a man.  'The  door  is  open’  is  true  for  a man  v/hen  a door  is  so 
situated  that  he  would  take  it  as  the  natural  momentary 
reference  of  'the  door’  and  it  is  (whether  he  knows  it  or  not) 
open.  The  individual  event  of  utterance  can  still  be  described  as 
true  absolutely,  since  a time  and  a man  are  specific  to  it;  but  talk 
of  sentences  as  true  for  men  at  times  covers  more  ground,  for  it 
includes  cases  where  the  sentence  is  not  uttered  by  the  man  in 
question  at  the  time  in  question. 


[8]  Cf.  Jan  Berg,  "What  is  a Proposition?",  Logique  et  Analyse,  Vol.  10,  Dec.  1967 
(summarized). 

[9]  Cf.  the  arguments  summarized  in  Howard  Pospesel,  "The  Non-Existence  of 
Propositions",  The  Monist,  Vol.  53,  April  1969  (summarized). 
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Relativity  to  times  and  persons  can  ot=  awkward  on  account 
of  the  supplementary  specifications  in  v/hich  it  keeps  involving 
us.  This  is  no  doubt  one  reason  why  philosophers  nave  liked  to 
posit  supplerrientary  abstract  entities  - proi>oaiiinus  - as 
surrogate  truth-vehicles. "[10] 

If  a person  utters  a sentence  in  a certain  manner  in  a ce-tain  place,  at  a certain 
time,  to  a certain  interlocutor,  in  brief,  in  a certain  context  and  "cote/t"  ^verb.nl  or  dinio-'U  .' 
context),  he  has  said  something,  niadr  n Mnieinent  , by  vsiiip;  a srnirnrc  in  u ri'rifiin  u<i\ 
[11].  Many  conteinporary  philosophers  regard  stateiiKnts  as  "primary  truth -boarer  ."; 
they  regard  statements  as  v/hat  are  (timelessly)  true  or  false.  Some  furthur  claim  Ihial 
statements  are  representable  by  oiornnl  sonh'nceit  , i.e.  sentences  the  values  of  all  of 
v/hose  indexical  terms  have  been  explicitly  specified,  v/hose  truth-value  consequently 
stays  fixed  through  time  and  from  context  to  context.  One  might  thus  think  of  a statemoi't 
as  a pair  comprising  a sentence  and  a coriiplcte  interpretation  of  that  senti.'ncc;  the 
interpretation  of  the  sentence  v/ould  ideally  provide  a complete  function  froir,  possible 
states  of  affairs  (possible  worlds)  to  truth-values  for  that  stateinent.  SucLi  a function,  in 
other  words,  vrould  be  a complete  and  precise  specification  of  the  truth-conditions  of  that 
sentence  as  used;  it  would  include  a set  of  specifications  of  truth-conditions  '.vhich  stein 
from  the  conventional  semantic  meaning  of  the  sentence  taken  just  as  a sentence  of  the 
English  (or  other)  language  it  belongs  to,  plus  complete  specifications  of  the  values  of  all 
the  indexical  terms  in  the  sentence,  given  tiie  context  of  use,  and  unambiguous  definite 
descriptions  of  the  referents  of  the  ambiguous  or  vague  definite  clescriptiOiTS  in  th.c 
sentence.  Such  interpretations  correspond  lo  what  some  formal  logicians  hr.'e  called 
intcnsioi'io  in  their  raodels.  Such  complete  specification  is  possible  by  fiat  in  tiie  domain  of 
formal  semantics.  It  is  a mioot  point  whether  such  a complete  specification  is  possible  for 
a natural  language  sentence  uttered  in  everyday  circumstances. 

It  is  important  to  note  that  philosophers  may  speak  of  propositions  being  expressed 
by  sentences,  and  of  statements  being  made  by  uttering  sentences,  but  they  do  not  tall  of 
sentences  denoting  or  referring  to  propositions  - except  in  one  case.  The/  do  talk  of 
nominalized  sentences  in  modal  or  intentional  contexts  as  being  "proposition-denoting 
expressions".  Consider  the  sentences  "It  is  impossible  that  Mary  is  sick"  and  "Jolsn 
believes  that  Mary  is  sick".  In  boih  of  these  cases  one  finds  the  nominalized  sentence 
"that  Mary  is  sick".  A traditional  analysis  of  the  logic  of  such  sentences  claims  tiiat  tlie 
expression  "tliat  Mary  is  sick"  is  an  expression  which  denotes  the  proposition  expressed 
(but  not  denoted)  by  the  sentence  "Mary  is  sick"  when  occunng  outside  such  "opaque" 
contexts  as  modal  or  (especially)  intentional  contexts. 


[10]  Willard  V.O.  Quine,  "Word  and  Object”,  The  M.l.T,  Press,  1960,  pgs.  191-192. 

[11]  For  important  articles  v/hich  make  use  of  this  terminology,  cf.  E.J.  Lemmon, 
"Sentences,  Statements  and  Propositions",  in  J.F.  Rosenberg  and  Charles  Travis,  eds., 
'Readings  in  the  Philosophy  of  Language’,  Prentice-Hall,  Engle.vood  Cliffs,  f'lev/  Jerso/, 
1971,  and  the  two  articles  by  P.F. Strawson  - "On  Referring",  Mind,  1950,  and  "identifying 
Reference  and  Truth-Vaiues",  Theona,  Vol.  XXX,  1964.  All  of  the  three  above-mentioned 
articles  are  summarized. 
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The  two  differences  between  the  linjuistic  and  the  logical  notion  of  ’proposition’  (or 
’statement’)  which  are  most  relevant  to  us  are  well  sumnriarized  by  Katz: 

"...[a]  point  about  our  [i.e.  some  linguists’]  use  of  the  term 
’proposition’...  is  that  the  class  of  propositions  cannot  be 
identified  with  the  class  of  statements,  where  statements  are 
understood  as  the  logical  objects  that  are  the  bearers  of  truth 
vaiues  (i.e.  as  the  objects  that  obey  the  law  of  the  excluded 
middle).  The  fact  that  our  characterization  of  the  class  of 
propositions  encompasses  a multitude  of  nonassertive 
propositions  (questions,  requests,  etc.),  for  which  it  maKes  no 
sense  to  talk  about  truth  and  falsity,  makes  this  amply  clear.  But 
we  cannot  even  identify  the  class  of  statements  with  the  class  of 
assertive  propositions,  since  a proposition  with  a token  indexical 
element  cannot  have  a fixed  truth  value."[lbid.,  pgs.  122-123] 


The  third  notion  of  'proposition’  which  one  can  pick  out  of  the  literature  is  that  of 
the  speech  act  theorists.  To  put  it  rather  vaguely,  for  speech-act  theorists,  the 

proposition  expressed  by  someone  who  utters  a sentence-token  is  wlml  i.e  lofi, 
syntnclir.nlly  and  xflinnnlicnlly,  in  llu'  uUorc.d  s^cnicnc.c.  token  after  nil  of  itx  comtitu.enls 
relevant  to  a determination  of  itn  illocutionary  foree  have  been  almtrneted  from  it  [12]. 
Let  us  examine  Searle’s  notion  of  proposition  which  is  of  this  type. 

"Irriagine  a speaker  and  a hearer  and  suppose  that  in  appropriate 
circurr[Stances  the  speaker  utters  one  of  the  follov/ing  sentences: 

1.  Sam  smokes  habitually. 

2.  Does  Sam  smoke  habitually? 

3.  Sam,  smoke  habitually! 

4.  Would  that  Sam  smoked  habitually. 


...anyone  who  utters  one  of  these  can  be  said  to  have 
uttered  a sentence  formed  of  words  in  the  English  language.  But 
clearly  this  is  only  the  beginning  of  a description,  for  the  speaker 
in  uttering  one  of  these  is  characteristically  saying  something  and 
not  merely  mouthing  words.  In  uttering  1 a speaker  is  making 
(what  philosophers  call)  an  assertion,  in  2 asking  a question,  in  3 
giving  an  order,  and  in  4 (a  somewhat  archaic  form)  expressing  a 
wish  or  desire.  And  in  the  performance  of  each  of  these  fou: 
different  acts  the  speaker  performs  certain  other  acts  which  are 
common  to  all  four:  in  uttering  any  of  these  the  speaker  refers  to 
or  mentions  or  designates  a certain  object  Sam,  and  he 
predicates  the  expression  "smokes  habitually"  (or  one  of  its 
inflections)  of  the  object  referred  to.  Thus  we  shall  say  that  in 

[12]  Cf.  John  R.  Searie,  "Austin  on  Locutionary  and  Illocutionary  Acts",  in  J.F.  Rosenberg 
and  C.Travis,  eds.,  op.cif.,  E.Stenius,  "Mood  and  Language  Game",  Synthese,  Vol.l7,  1967, 
Lennart  Aqvist,  "Semantic  and  Pragmatic  Characterizability  of  Linguistic  Usage",  Synthese, 
Vol.17,  1967. 
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the  utterance  of  all  four  the  reference  and  predication  are  the 
same,  though  in  each  case  the  same  reference  and  predication 
occur  as  part  of  a complete  speech  act  *//hich  is  different  from 
any  of  the  other  three.  We  thus  detach  the  notions  of  referring 
and  predicating  from  the  notions  of  such  complete  speech  acts  as 
asserting,  questioning,  commanding,  etc.,  and  the  justiiication  for 
this  separation  lies  in  the  fact  that  the  same  reference  and 
predication  can  occur  in  the  performance  of  different  complete 
speech  acts.  Austin  baptized  these  complete  speech  acts  with 
the  name  "illocutionary  acts"... 

Whenever  two  illocutionary  acts  contain  the  same  reference  and 
predication,  provided  tiiat  the  meaning  of  the  referring 
expression  is  the  sair.e,  I shall  say  .the  same  proposition  is 
expressed.  Thus,  in  the  utterances  1-5,  the  same  proposition  is 
expressed.  And  similarly  in  the  utterances  of: 

6.  If  Sam  srriOkes  habitually,  ho  v/ill  not  live  long. 

7.  The  proposition  that  Sam  smokes  habitually  is 
uninteresting. 

the  same  proposition  is  expressed  as  in  1-5,  though  in  both  6 
and  7 the  proposition  occurs  as  part  of  another  proposition. 

Thus  n i>roi>r,sitinn  is  to  ho  sharply  dislinr.^iishrd  front  an 
assertion  or  siatrinont  of  it  , since  in  utterances  of  1-7  the  saiiic 
proposition  occurs,  but  only  in  1 and  5 is  it  asserted.  Stating 
and  asserting  are  acts,  but  propositions  are  not  acts.  A 

proposition  is  what  is  asserted  in  the  act  of  assorting,  what  is 
stated  in  the  act  of  stating.  The  sarae  point  in  a different  v/ay; 
an  assertion  is  a (“ery  special  kind  of)  commitment  to  the  truth  of 
a proposition.  ... 

1 might  Summarize  this  part  of  my  set  of  distinctions  by  saying 
that  I am  distinguishing  between  the  illocutionary  act  and  the 
propositional  content  of  the  illocutionary  act."  [13] 

Searlc  proposes  an  analysis  of  uttered  sentence  tokens  v/hich  would  distinguish  bctv/ccn 
(a)  an  illocutionary  force  indicator,  representing  those  aspects  of  the  uttered  sentence 
relevant  to  the  determination  of  its  illocutionary  force  (such  as  the  presence  of  certau'i 
performative  verbs,  v/ord  order,  stress,  the  iriood  of  the  verb,  and  so  on),  and  (b)  a 
propositional  indicator,  representing  that  aspect  of  the  utterance  which  is  neutral  to 
illocutionary  force,  viz.,  the  proposition  expressed.  Tliis  propositional  indicator  might  also 
be  called  a "sentence  radical". 

We  have  a wealth  of  different  notions  of  'proposition’  to  choose  betv.'cen.  Rather 
than  debate  their  respective  values,  we  must  now  ask  what  notion  is  closest  to  that  v/hich 
the  team  would  like  to  investigate. 


[13]  John  R.  Searle,  "Speech  Acts",  Cambridge  University  Press,  1969,  pgs.  22-30. 
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1.2.2  THE  TEAM’S  NOTION  OF  (REPEATED)  PROPOSITIONAL  REFERENCE 

The  dialogue  analysis  team  is  at  present  seeking  to  formulate  a notion  of  proposition 
congruent  with  its  research  interests,  and  to  employ  that  notion  in  selecting  a set  of 
phenomena  which  it  feels  it  is  presently  ready  to  examine. 

I.2.2.1  The  team’s  notion  of  proposition.  No  description  exists  in  print  as  yet  of 
v/hat  the  team’s  concept  of  proposition  or  of  propositional  reference  might  be 
(propositional,  as  opposed  to  text  reference,  was  not  mentioned  in  the  "Observation 
Methods"  report).  The  following  remarks  are  therefore  based  on  discussions  I have  had 
with  members  of  the  team,  particularly  with  Jim  Levin. 

It  wouid  seem  that  the  team  is  moving  towards  a notion  of  proposition  which  is  much 
more  akin  to  that  of  Searle  and  of  some  linguists  than  to  that  of  the  logicians’.  The  toam 
is  employing  a notion  of  proposition  as  a theoretical  notion  employed  in  the  context  of  the 
modeling  of  dialogue  by  means  of  semantic  nets.  This  notion  is  to  be  understood  by 
reference  to  a certain  form  of  representation  of  utterances  in  a dialogue.  In  order  to  see 
this  more  clearly,  consider  the  utterances  in  (l)-(3)  below,  accompanied  by  one  form  of 
representation  which  the  toam  rriight  employ.  (In  these  examples,  we  suppose  that  Bill  is 
addressing  his  utterances  to  John.) 

(1)  Bill:  You  will  shut  the  door. 

(SAY 

(BILL 

TIME-1 

JOHN 

(SHUT 

(JOHN 

DOOR-1 

TIME-2)))) 

(2)  Bill;  Will  you  shut  the  door? 

(ASK 

(BILL 

TIME-1 

JOHN 

(SHUT 

(JOHN 

DOOR-1 

TIME-2)))) 

(3)  Bill:  John,  shut  the  door! 

(ORDER 

(BILL 

TIME-1 

JOHN 

(SHUT 

(JOHN 

DOOR-1 

TIME-2)))) 
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The  throe  representations  have  a cooHV.on  eloiv.ont,  namely,  (SHUT  (JOHN  DOOR-1  TIME-2)). 
This  common  element  is  very  siniilar  to  what  Searle  calls  a proposition:  it  is  a predication 
abstracted  from  different  illocutionary  acts  (and  not  only  from  statements  or  assertions). 
This  common  element  is  part  of  what  is  a proposition  expressed  by  a declarative  sontonce, 
e.g.,  in  the  case  of  (1),  the  proposition  expressed  by  a sentence  such  as  "Bill  asked  John  at 
tiiTie  tl  to  shut  the  door  at  time  t2".  This  matrix  sentence  or  proposition  contains  an 
illocutionary  verb  ("say",  "ask",  "order",  etc.)  and  thus  conveys  not  only  the  proposition 
expressed  by  the  common  element,  but  also  the  illocutionary  force  with  which  that  cominon 
element  v/as  produced.  Notice,  however,  that  the  representation  is  one  in  which  indcxicals 
are  filled  in,  and  in  which  the  referents  of  noun-phrases  in  the  utterances  arc; 
unambiguously  specified  (thus  DOOR-1  is  a GENSYM);  in  this  respect  the  representation  i., 
akin  to  the  logicians’  representations  of  statements. 

Given  this  form  of  representation,  one  might  define  proposition  in  one  or  more  of 
several  ways.  One  might  reserve  the  term  for  the  representation  of  an  act  of  uttenr.;;  h 
sentence  taken  in  its  entirety.  One  might  term  proposition  any  complete  representational 
unit,  i.e.  any  verb  and  its  arguments!  a proposition  in  this  sense  would  include,  of  course, 
propositions  in  the  first  sense  above.  Or  one  might  exclude  the  matrix  representation,  and 
Only  term  its  coraponents  propositions;  those  propositions  v;ould  then  correspond  to 
(nominalized)  sentential  clauses  in  the  utterances.  To  illustrale  these  possibilities,  let  us 
consider  the  utterance  and  its  representation  bclov;: 

Bill:  I am  sick,  and  I believe  that  I am  going  to  faint. 

(SAY 

(BILL 

TIME-1 

JOHN 

(AND 

(IS  BILL  SICK  TIME-1) 

(BELIEVE 

(BILL 

TIME-1 

(FAINT 

(BILL 

TIME-2] 

The  entire  representation,  [SAY  ...]  is  a proposition  in  the  first  sense.  Propositions  in  the 
second  sense  include  (a)  [SAY...],  (b)  [AND...],  (c)  [IS...],  (d)  [BELIEVE...]  and  (e)  [FAINT...]. 
Propositions  in  the  third  sense  include  only  (b)-(e),  and  not  (a). 

At  the  present  stage  of  discussion,  I can  only  throw  out  these  alternatives  for  the 
sake  of  debate. 

1. 2.2.2  The  team  currently  has  a clearer  notion  of  just  what  it  is  that  they  v/Ishi  to 
explore  under  the  heading  of  the  term  "repeated  propositional  reference"  than  they  do  of 
v/hat  they  wish  to  define  their  notion  of  proposition  as.  So  let  us  now  turn  to  a 
consideration  of  soine  dialogue  phenomena  v/hich  the  team  (a)  feels  am  inslances  of 
repeated  propositional  reference,  and  (b)  are  interested  in  investigating. 

The  following  is  a series  of  constructed  examples  of  dialogue  excerpts  which  the 
team  would  feel  involve  instances  of  repeated  propositional  reference  phenomena  of  a 
typo  currently  worthy  of  analysis. 
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(1)  A:  Mary  is  sick. 

B:  That’s  unfortunate. 

(2)  A:  I have  no  money.  Five  members  of  my  family 

are  deathly  ill.  I can’t  sleep  at 
night  because  I have  to  keep  fighting 
off  the  blood-thirsty  rats  in  my 
vermin-infested  apartment  block. 

B:  Your  story  is  not  so  different  from  the 
stories  most  other  people  in  the 
neighborhood  could  tell. 

(3)  A;  My  daughter  is  expecting  a child. 

3:  Yes,  but  it’s  a great  secret;  don’t  tell 
anyone  else  about  it. 

(4)  A:  John  said  that  Mary  is  sick. 

B:  Paul  told  me  that  piece  of  bad  news  yesterday. 

(5)  A:  John  said  that  Mary  is  sick. 

B:  If  what  he  said  is  true,  we  can’t  have  our 
picnic. 

(6)  A;  John  believes  that  Mary  is  sick. 

B:  Yes,  but  Paul  doubts  it. 

(7)  A:  John  believes  that  Mary  is  sick. 

B:  What  he  believes  is  true,  unfortunately. 

When  one  first  rapidly  glances  at  the  above  examples,  one  feels  that  they  are  similar  in 
that  in  each  dialogue  some  pronoun  or  noun  phrase  is  used  to  talk  about  something  which 
has  previously  been  talked  about  by  the  use  of  some  sentence(s)  or  nominalized  sentential 
phrase.  One  also  feels  that  this  similarity  can  only  be  specified  in  very  vague  terms,  as 
v.^as  just  done.  And  a closer  look  at  the  examples  shows  why;  there  is  an  extraordinary 
variety  of  things  going  on.  In  (1),  the  first  sentence  Is  used  to  describe  a state  of  affairs, 
v/hich  is  then  referred  to  and  commented  on  by  the  second  utterance.  In  (2),  the  first 
turn  involves  a description  of  a state  of  affairs,  and  the  second  utterance  comments  not  so 
much  on  that  state  of  affairs  as  on  the  description  which  was  made  of  it  (even  though  one 
v/ould  clearly  hesitate  to  say  that  a texi  reference  was  being  made).  In  (3),  the  first  turn 
involves  the  imparting  of  a piece  of  information  which  is  a description  of  a state  of  .iffairs, 
but  v/hich  is  commented  upon  as  a piece  of  information  in  the  second  turn  (the  information 
is  true,  but  its  a secret  - i.e.  it  is  a piece  of  information  which  has  not  been  imparted  to 
many  people).  In  (4),  indirectly  quoted  speech  is  reported,  and  then  is  said  to  be  identical 
v/ith  some  other  reported  speech;  (5)  again  involves  reported  speech,  but  the  statement 
v/hich  was  said  to  be  made  is  then  treated  as  a proposition  in  a "transparent"  context,  in 
(6)  and  (7),  an  Intentional  object  - a belief  - is  talked  about,  but  in  (6)  it  is  talked  about 
v/ithin  an  opaque  context  by  both  participants,  v/hereas  in  (7)  a belief  is  talked  about  once 
in  an  opaque  context  and  again  in  a transparent  context. 

This  heterogeneity  is  not  such  as  to  indicate  that  the  PREF  pheno(T,ena  v/hich  the 
team  currently  wishes  to  study  are  co'-ipletely  ill-defined,  however.  First  of  all,  the  team 
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is  well-united  on  the  decision  that  they  do  not  wish  to  study  PREF  phenomena  which 
involve  relationships  between  two  synonymous  sentences  used  in  a dialosuei  nor  do  they 
v/ish  to  study  under  the  heading  of  PREF  the  relationship  which  holds  between  two 
utterances  in  a dialogue  such  that  the  participants  who  uttered  them  thereby  "said  the 
same  thing"  or  made  the  same  statement.  Thus  the  following  pairs  of  sentences,  though 
they  are  instances  of  PREF,  are  not  of  current  interest. 

A:  Mary  is  sick. 

B:  Mary  is  ill. 

A:  You  are  hot. 

B:  Yes,  I am  hot. 

Secondly,  there  v/ould  seem  to  be  some  consensus  on  not  including  in  the  study  of  PREF 
an  investigation  of  the  relationship  of  co-rcferentiality  between  noun-phrases  which 
denote  what  we  would  ordinarily  regard  as  propositions  or  statements.  NP-MP 
co-reference  is  thus  outside  of  the  scope  of  current  PREF  analysis,  and  such  dialogues  as 
those  below  are  not  to  be  studied  under  that  heading: 

A:  John’s  assertion  is  simply  not  true! 

B:  I can't  see  v./hy  not.  His  claim  seems  vrell 

supported  by  all  the  available  evidence. 

A:  De  Morgan’s  law  is  a very  important  one. 

B:  Well,  his  theorem  has  certainly  been  useful. 

Thirdly,  there  are  certain  referential  phenomena  which  are,  intuitively,  quite  distinct  from 
vrhat  vre  are  groping  at  above.  One  is  reference  to  physical  objects;  another  is  rcferciue 
to  text  per  se.  A third  phenomena  which  is  distinct  is  reference  to  actions,  as  in  (S)  and 
(9)  below: 

(8)  A:  John  went  fishing  yesterday. 

B:  Mary  did  so  too. 

(9)  A:  Sky-diving  without  any  training  is  exhilarating. 

B:  It’s  also  foolish  and  suicidal.  [14] 

The  distinction  between  propositional  reference  and  action  reference  is  often  intuitively 
clear,  but  it  is  very  hard  to  formulate.  It  cannot  bo  pinned  down  in  syntactic  terms  alone. 

One  is  tempted  to  say  that  neither  the  "it"  nor  the  "so"  in  the  above  e/amples  are 

propositional  references  because  they  cannot  be  analysed  as  standing  for  sentences, 
nominalized  or  no.  The  second  turn  in  (8)  could  be  rendered  as  "Mary  did  go  fishing 

yesterday  too"  but  not  as  "Mary  did  J>  hn  wont  fishing  yesterday  too".  However,  the 

second  turn  of  (9)  could  be  rendered  as  "For  people  to  go  skydiving  v/ithout  any  training 
is  also  foolish  and  suicidal".  And  the  noun  phrases  or  pronouns  involved  in  PREF  often 

[14]  An  interesting  example  of  reference  to  an  action  which  ca.i  occur  in  dialogue  is 
reference  to  a performative  act  previously  performed  by  a participant,  as  in: 

A:  You’re  a bastard! 

B:  That’s  un-callod  for! 
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cannot  simply  be  replaced  by  sentences:  consider  "your  story"  in  example  (2). 

Given  the  above,  we  will  say  that  repeated  propositional  reference  of  the  type  the 
team  is  currently  interested  in  studying  occurs  in  a dialogue  when 

(1)  a sentential  or  multi-sentential  utterance  in  the  dialogue  is  subsequently 

mentioned  by  means  of  a pronoun  or  noun-phrase  in  such  a manner  that  v^hat 
is  being  mentioned  is  not  the  utterance  considered  purely  as  text  (i.e.  as  an 
uninterpreted  phonetic  or  graphic  token,  or  as  a member  of  some  class  of 
tokens). 

(2)  Some  sentence  uttered  in  the  dialogue  contains  a nominalized  sentence  which 

refers  to  a state  of  affairs,  a statement,  a reported  utterance  or  an  intentional 
object,  and  some  subsequent  pronoun  or  noun-phrase  is  co-referential  vyith 
that  nominalized  sentence. 


Syntactically  speaking,  then,  PREF  involves  only  sentences  and  nominalized  sentences  on 
the  one  hand,  and  noun-phrases  and  pronouns  on  the  other  (see  diagram  overleaf). 

The  above  delimitation  of  PREF  is  highly  criticlsable;  it  is  a description  by  exclusion 
(cf  (D),  and  contains  highly  problematic  terms  (e.g.  "state  of  affairs").  However,  it  is  the 
least  bad  proposal  I can  come  up  with. 

There  is  a terminological  problem  which  remains  to  be  dealt  with.  it  is 
unsatisfactory  to  use  the  term  "propositional  reference",  to  describe  a phenomena  which 
covers  some  phenomena  (those  described  in  (1))  which  do  not  involve  reference  in  the 
usual  sense  at  all.  I shall  continue  using  the  term  PREF,  but  only  until  a better  term  is 
found. 
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SECTION  II: 

DETECTION  PROCEDURES 
FOR 

TEXT  AND  REPEATED  PROPOSITIONAL 
REFERENCE 

In  this  section,  we  will  discuss  constraints  on  TREF  and  PREF,  as 
described  above,  which  mijjht  be  of  some  assistance  in  detecting  their 
occurence  by  analysis  of  dialogue  transcripts. 

Those  constraints  will  be  induced  from  an  examination  of  dialogue  sarriples  in  which 
TREF  and  PREF  appear  to  occur.  The  examples  will  be  of  three  types;  some  v/ill  be 
examples  of  real  dialogue,  sonio  will  be  examples  of  dialogues  drav/n  from  literary  'worhs, 
such  as  plays  or  short  stories,  and  some  will  bo  examples  which  have  been  made  up 
(constructed)  to  illustrate  a point  or  a difficulty.  Each  example  provided  v/ill  be  marked  by 
an  R,  an  L or  a C,  according  to  its  source. 

The  first  observation  which  one  makes  when  one  considers  actual  dialogue  is  that 
TREF  is  much  less  frequent  than  PREF.  Since  TREF  is  more  of  an  exceptional  phcnomcnri, 
v/e  v/ill  examine  it  first,  with  the  hope  that  occurences  of  it  may  be  signaled  more 
explicitly  than  are  occurences  of  PREF. 


II.  1 THE  DETECTION  OF  TEXT  REFERENCE 


As  we  have  defined  or  described  it  above,  TREF  always  involves  the  use  of  at  lof.t 
one  expression  to  refer  to  text.  A text-referring  expression  (TREXPR)  [15]  may  be  an 
ordinary  noun-phrase  ("his  name",  "what  John  wrote  on  the  board"),  or  may  be  nn 
expression  which  refers  to  text  by  cither  exhibiting  or  replicating  it.  So  if  we  v/ish  to 
detect  instances  of  TREF,  we  must  (a)  find  ways  of  determining  whoti  er  an  expression  is 
being  used  as  a TREXPR. 

But  once  we  have  determined  that  a given  expression  is  a TREXPR,  wc  are  not 
thereby  assured  that  we  have  before  us  an  instance  of  TREF  as  v/e  have  conceived  of  it. 
For  a particular  TREXPR,  say  El,  lo  be  involved  in  TPEF,  one  of  the  following  two  cases 
must  obtain,  (i)  There  is  another  token,  say  E,  used  in  the  dialogue,  which  is  not  a 
TREXPR,  and  which  is  either  identical  with  or  a member  of  the  referent  of  El.  (ii)  There  is 
another  TREXPR,  say  E2,  used  elsewhere  in  the  dialogue,  whose  referent  is  either  identical 
v/ith,  a subset  of,  or  a member  of  the  referent  of  El.  Thus,  once  v/e  have  detected  the 
presence  of  a TREXPR  in  a dialogue  (a),  we  must  (b)  compare  its  referent  to  other 
non-TREXPRs  in  the  dialogue  and  to  the  referents  of  other  TREXPRS  in  the  dialogue,  if 
there  are  such,  and  (c)  decide  on  the  basis  of  this  comparison  v/hether  a TREF  is  occuring, 
and  if  so,  what  other  expression  is  involved. 

The  above  suggests  an  outline  of  a procedure  for  detecting  TREF: 

1.  Find  all  occurences  of  TREXPRS. 

2.  For  each  TREXPR; 

[15]  For  the  remainder  of  section  II,  unless  otherwise  specified,  we  v/ill  use  the  term 
"expression"  to  mean  expression-token. 
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2.1  Draw  up  a list  of  candidates  for  comparison 

with  the  TREXPR.  This  list  will  include 

- other  TREXPRs,  if  present 

- "suitable”  tokens,  not  TREXPRs, 

^ ^ _ which  are  used  in  the  dialogue. 

2.2  Compare  the  TREXPR  with  the  candidates. 

- If  the  candidate  is  a TREXPR,  determine 
whether  its  referent  is  identical  with, 

a subset  of,  Or  a member  of,  the  referent 
of  ine  TREXPR  under  study. 

- If  the  candidate  is  a non-TREXPR,  detemine 
whether  it  is  identical  with  or  a member  of 
the  referent  of  the  TREXPR  under  study. 

If  no  comparisons  suceed,  conclude  that  no  TREF  is 
occuring.  ■' 

If  only  one  comparison  succeeds,  return  the  pair 
of  successfully  compared  expressions  as  the 
TREF  which  is  occuring. 

If  several  corr,parisons  succeed,  continue. 

2.3  Apply  come  evaluative  criterion  (or  criteria)  to 

the  pairs  of  expressions  which  have  been  successfully 
compared.  If  one  pair  is  clearly  a "best"  match, 
return  it  as  the  TREF;  if  several  pairs  are  almost 
equally  "good",  return  the  TREXPR  under  study 
along  with  all  other  members  of  these  pairs  as 
the  (multiple)  TREF  which  is  occuring. 


With  this  vaguely  defined  procedure  in  mind,  let  us  turn 
corpus  of  examples  of  text  reference  (listed  overleaf). 


to 


a consideration  of  a 


Th„  ...It’'  “'"I''  '“"’‘‘If  <iialosue  examples  which  are  eilher  real  or  literary  (licliohal) 
I.Ih.ir  are  either  examples  ol  wriltcn  dialoaue  or  ol  oral  dialo-ues  which  were 

Sdb  equenlly  traosenb.d;  the  literary  examples  were  written.  We  will  „ve  lesl^  wemht  o 
contirmatioh  ol  procedures  by  transcribed  exaraples  when  those  procTdurerrely  IIo' 

10  “his  Is  IhlTortho  "h"’“  " 'abt'aliaation).  The  reason 

kaloele.  bit  1 thi  T Pa'ticipants  in  the 

^ ^ transcriber,  and  are  the  result  of  a decision  by  the  transcriber  that 

«»oni6  express  on  was  bejn®  usod  a tppvpd.  t ..i 

ca-es  is  a Vrrn  nt.t’  f r,  lu  ^^ly  upon  orthographic  cues  in  such 

ca.es  a cop-out  from  the  point  of  view  of  someone  who  wishes  to  to  specify  computer 
programs  which  will  detect  TREF  independently  of  human  judgments. 

which  wHI  ThL'br'fnV^wT^'^^  subprocedures  or  criteria  for  each 

Dh  aej  Lh  h ‘0  subsequent  examples.  We  will  at  first  only  deal  with  noun 

do'wilh  pIonouT“  ■ ‘■"■'r-'i  '» 


Lot  us  bosin  with  example  (1)  below.  (In  all  of  our  examples  we  will  italicise  the 

:ur“'e“„7dt:isir„.'“'' 
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L:  Yc5,  I linKcd  to  PARC-MAXC  and  transferred  a file  - but  couldn’t  run 
it  at  PARC-MAXC  - ll)F/]S?(i) 

0:  What’s  IDF/\S?  (2) 

L:  Sorry,  mistyped  - ideas? 


(REAL  - WRITTEN  - SOURCE:  0C32.PR0T0C0L) 


Wo  intuitively  perceive  the  second  occurence  of  "IDFAS",  (2),  as  a TREXPR.  This 
su^oests  that  we  regard  expressions  which  are  not  in  our  lexicon  either  as  words  of 
English  or  as  names  as  TREXPRs;  but  this  principle  is  wrong,  because  it  would  lead  us  to 
mark  the  first  occurence  of  IDFAS,  (1),  as  a TREXPR  also,  wheras  we  clearly  percieve  it  to 
be  a sirriple  expression  (albeit  a rrastyped  version  of  an  expression)  which  L use;.. 
Sirsilarly,  simple  capitalization  is  not  a cue,  for  both  occurences  of  "IDFAS"  are  capitalized. 
What  seems  to  indicate  to  us  that  (2)  is  a TREXPR  is  not  only  that  it  is  an  unrecognized 
syrnbol,  but  that  it  is  the  subject  of  a question.  So  this  leads  to  the  formulation  of  a 
principle  of  TREXPR  detection  (TD); 

(TDl.l) 

If  an  expression  is  not  in  the  lexicon,  then 
if  it  is  the  subject  of  a question, 
it  is  a TREXPR. 

This  principle  in  the  case  of  example  (1)  allows  us  to  conclude  that  there  is  one  and  only 
one  TREXPR.  What  are  candidates  for  comparison  with  it?  Intuitively,  we  perceive  that 
there  is  only  one  non-TREXPR  candidate:  the  first  occurence  of  "IDFAS",  (1),  ’which  is  a 
token  of  the  type  of  the  TREXPR  (2)  (this  is  an  instance  of  TREF  by  replication).  This 
loads  us  to  formulate  the  following  principle  of  candidate  selection  (CS): 

(CSl.l) 

If  there  is  a non-TREXPR  expression  in  the  dialogue 
which  is  orthographically  identical  with  the 
TREXPR  under  study,  then  it  is  a candidate. 

Since  there  is  only  one  candidate,  our  procedure  returns  the  two  occurences  of  "IDFAS"  ar. 
a TREF. 

Let  us  now  consider  example  (2)  below: 

L:  Guess  what  ...  that  didn’t  work  either.  It  took  "'/’’n  " (j)  to  be  a 
filename. 

0:  Did  you  say  TTY  (2)  or  TTY:  (3)  ? 

L:  Just  TTY.  (4) 

0:  if  you  append  the  colon,  then  it  will  be  recognised  as  a device 
desi.gnator  instead  of  file  name  (I  hope). 
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What  arc  the  TREXPRs  here?  Principle  TDl.l  picks  out  expressions  (2)  and  (3).  But 
intuitively  we  know  that  the  quotation-expres.  on  (1)  is  a TREXPR,  because  it  is  explicitly 
quoted.  Wo  wo  have  a new  TD  principle:  ^ ^ 

(TD2.1) 


If  an  expression  is  enclosed  in  quotation-marks, 
then  the  quotation-expression  (i.e.  the 
quotation-marks  plus  what  they  enclose) 
is  a TREXPR. 


We  still  do  not  have  sufficient  TD  principles,  however,  for  intuitively  we  perceive  the 
occurence  of  TTY”  in  L’s  second  turn  (4)  to  be  a TREXPR  also,  and  neither  TDl  nor  TD2 
would  mark  it  as  such.  I feel  that  I recognise  that  token  to  be  a TREXPR  for  two  reasons- 
(a)  It  IS  an  expression  which  is  not  in  the  lexicon,  and  (b)  it  is  said  to  have  been  uttered  by 
someone,  viz.  L (L’s  utterance  is  elliptical,  but  expandable  into  "I  just  said  TTY.”)  In  all  of 
the  constructed  sentences  below,  one  would  detect  a TREXPR: 


John  said/is  saying/says/will  say  blurpagg. 

Mary  shouted/is  shouting/shouts/will  shout  ARRGGHH. 

I wrote/am  writing/write/will  write  Xuytmon. 

Note  that  linguistic-reception  verbs  have  the  same  effect  as  these  linguistic-production 
verbs  ( hear  as  well  as  “say").  So  we  modify  TDl  as  follows- 
(TD1.2) 


If  an  expression  is  not  in  the  lexicon,  then 
if  it  is  the  subject  of  a question, 
it  is  a TREXPR; 

if  it  is  the  object  of  a verb  of  linguistic 
production  or  reception, 
then  it  is  a TREXPR. 

So  now  our  principles  allow  us  to  recognize  four  noun-phrase  TREXPRS:  (1)  in  turn  1,  (2) 

-in™  CSl.l,  we  find  that  there  are’  no 

non-TREXPR  candidates.  So  we  are  loft  to  take  each  of  the  four  TREXPRs  in  turn  and 
compare  them  with  the  three  others. 


We  percieve  (4)  to  be  co-referential  with  both  of  the  other  occu'-ences  of  "TTY",  and  we 

formulate  the  following  principle  of  co-referentiality  of  TREXPRS  (CR)  to  account  for  that 
fact: 

(CRl.l) 

If  there  are  two  TREXPRs,  El  and  E2,  and  one  of 
them,  say  El,  is  a quotation-expression,  then 

if  E2  is  orthographically  identic,  with 
the  quotation-content  (i.e.  the  string 
between  the  quotation-marks)  of  El, 

then  El  and  E2  are  co  -referential. 


We  perceive  the  first  and  second  occurences  of  "TTY",  (2)  and  (4).  to  be  co-referential, 
which  can  be  explained  as  foilows: 

(CR2.1) 

If  there  are  two  TREXPRs,  El  and  E2,  and  they  are 
orthographically  identical, 

then  they  are  co-referential. 

^1».°  of  ‘he  occurences  of  "TTY"  to  be  co-referential  with  each  other  and 

with  '"TTY"".  This  can  be  seen  as  a logical  consequence  of  the  fact  that  co-referentiality 
is  an  equivale.nce  relation.  One  would  be  wrong  in  so  concluding,  however,  for  one  must 


roforcnis.  And  ol  course  il  x pr  " ■uj'iy”’ 7 memder-set  relations  oi 

subset/memder  ol  z,  ir does  not  lollop  , t 7 h'  °'  ’ “n,  ^ “ 

subiet-superset/member-'-et  rpiatn  » u ^ either  identical  or  bear 

adopt  the  four-.:  ,c  h^riotic  w th  full  k^n  Ti’  I will  temporarily 

fact  that  its  falsity  may  nol  b^^  revealed  in  r^ost  d!alo 'ues  " 

(CR3.1)  u'aio^ue.u. 

For  any  three  TREXPRs.  £1.  £2,  and  £3,  if  £1 
and  £2  are  both  co-referential  v/ith  £3,  then 
El  and  £2  are  co-referential  with  each  other. 

This  lillols‘'L"rrth?a‘bovea!p°^  "““"-P'''"®  TREXPR. 

and  rccep?io7r„  delecrTREr  ctfai'"’"" 

exaippple  (3)  Pelolp  " P»=p"^'p  Consider 

'"‘"ihea'd'  “°  "”'  f""™  *)  Co 


/name  */  (1)  ...'''■'w->,^.»(,nterrupt  herel^'vwww 


....  We  would  like  to  unarchive  these. 


0:  In? 


Lp  We  are  in  directory  [,-,ame  3)  but  the  tapes  were  archived  Iron,  the 
[name  41  directory.  Go  ahead. 

Ok,  but  you  will  have  to  give  me  those  unines  (2)  acrain... 


(REAL  - WRITTEN  - SOURCE:  0C636.PR0T0C0L) 


Wo  perceive  expression  (2)  to  be  a TR^XPR  because  of  the  meanin^r  of  the  word  "naive".  . 

rLii^iirrVhScr •“  - -pv; 

(TD2.1) 

If  an  expression  is  a rne-nber  of  the  set  of  conventional 
TREXPRS  (C-  r.R.£XPRS),  then  it  is  a TREXPR. 

(TD3. 1 ) 

...  ®f  C-TREXPRS  is  <name{s)>. 

Vve  also  perceive  the  expre' „ ^n  (i)  to  bp  a TRpypo  tu  / 

th^ci'tj;  the  indirect  object  of  the  verb  "cal,",  which  th^  fdlc^in"  1:"::: 

(TD4.1) 


If  any  expression  is  the  indirect  object  of  the  verb 
, call  . then  it  is  a TREXPR. 

The  p.„c,pl«  TD3  and  T04  see„  ,o  Pe  p„„a  o„,  p, 

A.  Our  president  calls  us  „/  ,i) 

"T.lZ7Z.ll  a)T  “ ‘ 

At  Gedrse  Whipple?  No.  , Knew  a Ceer.,e  Cnllou„r  (2). 

B:  This  is  George  Whipple. 

A:  I don’t  recall  t/w  name  (4). 


(LITERARY  - WRITTEN  - SOURCE:  THE  TRAVELLOR) 


We  can  see  that  (3)  and  (4)  are  TREXPRs  using  TD4  1 (4)  of  ro  ,r.tAs 

Proc^essing  to  establish  that  ’’by  the  na.e  of  X”  introduces  ’a  textual  obj^cl'x:;^'^"’ a 

are  TR^^EXPranr^S^r^ttf  -^^‘^er  they 

acsociated  with  in  occurences  of  TREF  rtf  ii'  TREXPR  noun-phrases  arr 

example  (4),  we  shou  d bemn  to  B’s  second  turn  ,n 

it  is  useful  io  consider  cert  m H?.r, 

are  useful  in  cases  of  repeated'reference^^  ^'^l^cedents  of  pronouns  which 

propositions.  We  shall  only  be  concerned  rero°wrth“I™'a|  "Le'‘l?th'e'tte ^ "I 

Its  of  the  pronouns  'if.  -that-  „n  -,p“;.  "o'ocrmnat.on  of 


vwMv.t;fnt;u  nere  w 

ine  antecedents  of  the  pronouns  "it"  "that"  nnH  "n,-  » ■ a 

roforred  to  b,  the  pronouns  ••pe-  or  "(h:-;Lcte':' e;-rco'^:id' rL't  "" 

(GRl.I)  IF  A PRONOUN  = "IT",  "THAT"  OR  "THIS": 

" rente,jcV7h'errr'°“’ 

co-referentX  ‘ 

“ thl'  in"  i"  ‘’^""‘''"8  <»l'Pce  n is  less 

rr  or,lv^!l';r""’  ",  "■  > endidate 

priority  n+1  for  co-referentiality. 

" TreTnrdate.”"''''"'"  — P>-rases 


never 


(a)  Noun  phrases  within  same  sentence  have  a hi=>her 
priority  than  noun  phrases  in  preceding  sentences. 

(b)  Noun  phrases  in  nth  preceding  sentence  have  lower 
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priority  than  noun-phraees  in  mth  preceding  sentence, 
where  n and  m are  both  less  than  some  parameter  I 
and  n<m. 

(c)  Within  a sentence,  noun  phrases  in  subject  position 
have  a higher  priority  than  noun  phrases  in  object 
position;  noun  phrases  in  object  position  in  turn  have  a 
higher  priority  than  non  phrases  ‘n  prepositional 
phrases. 

(d)  Within  a sentence,  noun-phrases  v/ithin  a main 
clause  have  a higher  priority  than  noun-phrases  within 
a subordinate  clause. 

(e)  Within  a sentence,  focused  noun-phrases  have  a 
higher  priority  than  noun-phrases  which  are  not 
focused.  (Focused  noun-phrases  are  those  moved  to 
the  front  of  sentences  by  such  transformations  as 
extraposition  or  tough-movement.) 


(GR2.1) 

Candidates  for  co-rcferentiality  are  to  be  selected  by  comparing  the 
follov/ing  features  or  predicates  of  the  pronoun  v.  ith  those  of  the 
caitdidate:  (a)  sex  and  number  (b)  case  or  type  constraints.  If 
such  considerations  fail  to  select  one  candidate,  rely  on  more 
specific  plausibility  considerations.  [16] 

Let  us  see  how  these  general  rules,  in  conjunction  with  the  rules  we  have  specified 
so  far,  allov/  us  to  detect  TREF.  Consider  first  the  follov/ing  example  (5): 


A:  You’ve  surely  heard  me  speak  of  I'M/rciw  Tc.sU  (1)1 
B:  I can’t  say  that  I have. 

A:  Well,  his  name  (2)  is  always  in  the  newspapers;  he’s  a dramatic  star. 
Everyone  I Know  would  recognize  U (3). 

B:  I am  not  familiar  with  the  of  dramaiir  stars  (4).  I have  never 

seen  it  (5)  before. 

(CONSTRUCTED) 


[16]  These  general  rules  were  suggested  by  Jim  Levin,  on  the  basis  of  his  ov/n  work  and 
the  v/ork  of  others  in  the  Al  field.  I do  not  claim  that  he  would  endorse  the  form  that  I 
have  given  them  here. 
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fh^e  "^^Sene  Tesh"  is  marked  as  a TREXPR  because  it  is  not  in 

and  TD3  (i)  ‘ ° expression  (2),  which  is  marked  as  a TREXPR  by  rules  TD2 

when  \ in  a TREXPR.  Now  we  understld  that 

informahnn^""^’  ° ™de  by  the  introduction  of  his/her  name,  two  pieces  of 

oartimilar  o b<^ars  the  relation  name-of  to  that  individual.  And  in  this 

tn  th"  ■ H-  w understand  that  (2)  is  a reference  to  the  name  "Eugene  Tesh"  and  not 
individual  named  Eugene  Tesh.  We  might  capture  this  by  the  following  crude  rule: 

If  there  are  two  TREXPRs,  one  of  which  is  or  contains 
t^he  word  "name"  or  "names",  and  the  other  one  of  which 
has  been  interpreted  as  referring  to  an  individual  by 
name,  then  they  are  co-referential. 

We  are  supposing  that  the  phrase  “interpreted  as  referring  to  an  individual  by  name"  Iv. 

coi-e  tTtT'"”  syntactic  and  semantic  programs  yet  to  be  specified.  When  wo 

M he  pronoun  (3),  the  general  rules  comes  into  play.  According  to  these  rulo<-  the 

noun  pnr<itiCs,  r»incc  thcrs  nn  *'41  • • .1 

u ^ that  s or  this  c in  the  precedinc'  ^.entenrrT 

CR4  wn  M t dramatic  star  ).  And  (2)  would  be  selected.  When  we  come  to  (4) 

CO  rpf<.r  I r co-referential  with  (1),  and  CR3  would  mark  (1),  (2),  (3)  and  (4)  a-’ 

would  a-"ain  chain  a antecedent  of  (5),  and  CR3 

/rouid  again  chain  (l)-(5)  together  as  co-referential. 

A:  Hne!  The  other  one  is  a U'.N.  (1)  down  at  Permanente  on  Sunset 
Diva. 


B:  /,.K(V.  (2)  ...  T/iai’s  (3)  lanky  vertiginous  nurse? 
A:  Right. 


(REAL  - ORAL  - SOURCE:  BLIND-DATES.PROTOCOL) 


Our  rules  would  mark  (1)  and  (2)  as  TREXPRs,  by  TDl. 
co-referential,  by: 

(CR5.1) 


We  perceive  (1)  and  (2)  to  be 


If  two  TREXPRs  are  not  in  the  lexicon,  then 

if  they  are  orthographically  identical, 
tlicn  they  are  co-referential. 

t^e^hrtf  T'^  ! pronomina.  expression  (3),  the  general  rules  would  mark  (2)  as  bein-^ 
conr  1 acceptable  candidate  for  being  (3)’s  antecedent.  Note  that  this 


A further  example  showing  the  additional  development  of  the 
following,  example  (7): 


general  rules  is  the 
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A:  1 wonder  if  you  know  a conductor  by  fho  name  of  Cror,o 

B:  George  Whipple?  No.  I knew  a GW^e  Cnllowuy  (2). 

A:  This  is  George  Whipple. 

B:  I don’t  recall  tf,r  name.  (3). 


(LITERARY  - WRITTEN  - SOURCE:  THE  TRAVELLOR) 

Sm.  in  t "erpJita'tion'of  """  ^ P--.cor 

and  (ii)  introduce  a textual  obicct  an  individual  named  George  Whipple 

that  individual  Similarly  for  (2)  ’ R » Whipple  , which  bears  the  relation  namo-of  to 

"this"  in  "This  IS  Georc?^  ^ pronoun 

turn,  and  not  to  his  name  Our  individual  referred  to  in  the  first 

Galloway"  and  if  thev  ri  i ' probably  bind  "this"  to  "GeorTT,.; 

conclusion  that  George  Galloway^is  implausibility  of  the  resullant 

"George  Whipple".  ^ 'he  nair,-. 

cue..  or;REE°.'  Co."’:e: 

ahead"”'  " questions  about  "runoff"  to  on-line.  Go 


°''''abn„r''T  ^°‘ii"  I don’t  know  too  much 

about  It,  we  II  see  what  I can  find.  Hold  a -.ec  ...  O.K.  Shoot. 

' '"my  TTY  on'hne.! 


(REAL  - WRITTEN  - SOURCE:  0C3 70. PROTOCOL) 


Sbw\ 

and  eliminale  mislakU  ' 1'='  'P'' 
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Secondly,  there  is  the  problem  of  TREXPRs  which  contain  variables,  An  example 
would  be  "When  making  a new  connection  type  ATT  LINKER  (PASSWORD),  job  nurriber". 
Here  only  the  "ATT"  is  a replication  of  its  reforonti  "LINKER"  stands  in  for  a variable  otririf;, 
or  a variable  class  of  strings.  Logicians  have  wrestled  with  the  problem  of  variables 
within  quotation  contexts  because  they  needed  to  be  able  to  have  variables  ranging  over 
text  when  constructing  truth  definitions;  there  are  considerable  problems  here. 

Thirdly,  there  is  the  problem  of  the  use  of  quotation-marks  in  irony,  the  use  of 
so-callcd  snigger  quotes.  Jim  Levin  has  suggested  v/hat  is  undoubtedly  the  right  approach 
to  such  problems:  regard  quotation-marks  in  general  as  a signal  that  some  peculiarity  in 
processing  the  quoted  words  is  required.  The  most  frequent  peculiarity  thus  signaled  is 
that  the  words  themselves  should  be  retained,  but  other  peculiarities  should  be  allo'.ved 
for.  For  example,  quotation-marks  may  signal  that  a word  is  being  used  in  a sense  very 
different  (perhaps  opposite)  from  that  in  which  it  is  commonly  used,  or  that  some 
presupposition  of  the  use  of  that  word  is  not  obtaining.  To  return  to  example  (S)  above, 
for  instance,  it  is  possible  that  the  user  was  quoting  "runoff"  to  show  that  though  the  term 
usually  is  supposed  to  denote  a program  that  runs  off  formatted  copies,  he  is  unwilling  to 
use  it  in  that  way,  because  his  experience  leads  him  to  believe  that  the  program  in 
question  obstinately  refuses  to  run  off  copies! 

Despite  these  difficulties,  I believe  that  one  might  vyell  be  able  to  devise  a set  of 
heuristics  v/hich  would  detect  TREF  correctly  in  a satisfactory  number  of  cases.  TREF  is 
on  the  whole  much  less  intractable  than  i^REF,  to  which  we  now  turn. 
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11.2  THE  DETECTION  OF  REPEATED  PROPOSITIONAL  REFERENCE 


Our  general  approach  to  the  detection  of  repeated  propositional  reference  will  bo 
largely  similar  to  our  approach  to  the  detection  of  TREF.  That  is,  v/e  will  first  look  for 
certain  proposition-referring  expressions,  PREXPRs,  and  then,  once  PREXPR(s)  have  been 
found,  look  for  other  expressions  which  may  bo  involved  with  those  PREXPR(g)  in  repeated 
propositional  reference.  Lastly,  once  we  have  a PREXPR  and  a list  of  "candidates",  we  v/ill 
select  candidates  by  some  criteria. 

There  will  be  important  differences,  however.  PREF  involves  (a)  a noun-phrase  or 
pronoun  on  the  one  hand,  and  (b)  a sentence  or  norninalised  sentence,  on  the  other.  Mow 
the  first  conclusion  one  reaches  when  one  considers  instances  of  PREF  is  that  jnst  nlmni 
any  snm’ttrr  or  mmiunlitrd  srntrtwr  is  rnimlilr  of  imriir.ifxiiina  in  n HREI'\  As  a result, 
it  is  not  functional  to  first  pick  a sentence  Or  nominalized  sentence  and  then  look  for  a 
noun-phrase  or  pronominal  candidate  - there  would  simply  be  too  much  useless  processing 
involved  in  such  a procedure.  Rather,  one  must  first  find  a noun-phrase  or  pronoun 
which,  because  of  its  meaning,  grammatical  position  or  features  (i.e.  because  of  what  is 
predicated  of  it)  is  suceptible  of  being  involved  in  a PREF,  and  then  look  for  sentence  or 
nominalized  sentence  candidates. 

Belov/,  we  will  first  analyse  examples  of  PREF  which  involve  noun-phrases,  and  then 
look  at  examples  of  PREF  which  involve  pronouns. 

First,  let  us  consider  example  (1)  below: 

A:  You  know,  I just...  Tho  second  riucstion  (1)  would  be:  uihy  uusn't  this 
done  he.fore  / uietil  throufrh  nil  these  hone,  scans,  thyroid  .scans, 
and  you  know...  (2) 

B:  Well,  that’s  a... 

A:  I’ll  die  of  radioactivity. 

B:  Yes,  that’s  the.  logical  riuestion  (3)  and... 


(REAL  - ORAL  - SOURCE:  MEDICAL-CENTERED.PROTOCOL) 


We  see  intuitively  that  "question"  is  a noun-phrase  which  refers  to  v/hat  we  v/ould  call  a 
proposition,  so  that  both  (1)  and  (3)  are  noun-phrases  which  might  be  involved  in  a PREF. 
Other  noun  phrases  are  similar  to  "question"  in  this  respect,  for  example  "statement", 
"request",  "order",  "demand",  "query"  and  so  on.  This  suggests  the  follov/ing  principle  for 
detecting  PREXPRs  (PD): 

(PDl.l) 

If  an  expression  is  a member  of  the  set  of 
conventional  proposition-referring  expressions 
(CPREXPRs),  then  it  is  a PREXPR. 

(PD2.1) 
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The  set  of  CPREXPRs  is  <question(s),  request(s), 
order(s),  clemand(s),  asserflon(s),  claim(s), 
query(ies)....> 

We  also  see  that  (2)  is  co-referential  with  both  (1)  and  (3),  because  it  is  an  expression  of 
the  interrogative  form.  What  we  are  relying  on  here  is  a syntactic  requirement  which 
derives  from  the  meaning  of  the  noun  "question".  Certain  other  CPREXPRS  impose 
syntactic  requirements  upon  candidates,  for  example,  "advice"  in  the  following  example  (2): 

A:  And  now  I shall  give  you  nn  nxlrn  pirrn  of  ndvic.r.  (1).  Slop 
disgrnciiifr  your  dau.(>ltirr  u:ilh  your  oompnity  on  ihr  sirccis  - 
and,  ahovr.  all,  ai  thn  ihnatrr...  (2)  or  she  will  soon  have  every 
door  to  advancement  shut  to  her! 


(LITERARY  - WRITTEN  - SOURCE:  MOTHERLOVE) 


"Advice"  is  a CPREXPR,  and  we  know  that  (2)  is  a candidate  for  PREF  because  it  is  in  the 
imperative  mood.  A third  example  of  a dialogue  in  which  syntactic  cues  deriving  from  tlie 
meaning  of  a CPREXPR  are  used  in  determining  PREF  is  (3)  below: 

A:  General,  I only  want  to  keep  one  little  private  letter.  Only  one.  /.<?/ 
me  have  it.  (1) 

B:  Is  that  a rea.^onabh  demand  (2),  madam? 


(LITERARY  - WRITTEN  - SOURCE:  THE  MAN  OF  DESTINY) 


Here  again,  we  see  that  (2)  is  co-referential  with  (1)  because  demands  are  (usually  - I am 
being  sloppy  here  of  course)  expressed  by  means  of  imperative  sentences.  One  may 
therefore  adopt  the  following  heuristic: 

(TD3.1) 

Given  a CPREXPR,  if  that  CPREXPR  denotes  a type  of 
proposition  v/hich  is  usually  expressed  by  a sentence 
of  a certain  grammatical  mood  (declarative,  interrogative, 
imperative,  hoi  artory,  etc.),  then  any  nearby  sentence 
or  clause  which  is  of  that  mood  is  to  be  considered  as  a 
PREXPR  cadidate. 

We  have  included  the  phrase  "sentential  clause"  in  TD3.1  because  v/e  want  to  account  for 
cases  like  "Sam  is  curious  to  know  v/hether  or  not  the  Socialists  will  take  over  the 
Portugese  government.  - That’s  a good  question".  In  such  cases  one  finds  that  embedded 
questions  participate  in  PREF. 

Many  instances  of  PREF  involving  noun-phrases  require  a pretty  cornplete 
understanding  of  the  meaning  of  the  noun-phrase  in  question  in  order  to  select  candidates. 
I will  give  three  examples  below.  First,  example  (4): 

A:  Just  a passing  eomment  (i),  Joe.  We’re  having  lunch  right  now,  and  I 
just  made  myself  a hotdog  sandwich  with  catsup.  Very  lasiy  and 
almo.u  unheard  of  in  the  old  days.  (2) 
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(REAL  . ORAL 

CORRECTION-ACTIONS-CORPUS.PROTOCOL) 


SOURCE: 


(I  v/ill  suppose  that  the  ellipsis  in  (2)  has  been  filled  out.)  In  decidin=  that  (2)  rather  th.e 

upon  o^u7  u'^'j  for  PREF  involvement  with  the  CPREXPR  (1),  we  rel- 

upon  our  Undei  stalidlim  o w h.i  a tomiiir.iil  i.-  i i , ' 

an  opinion  or  a„i,udo.  “si»la„;,  <" 

“■  (2), 
mind,  /h^yro  rx,.rrn„u  « grandchild.  (3)  Isn’t  tha^  pood  news 


(LITERARY  - WRITTEN  - SOURCE:  THE  LONG  CHRISTMAS  DINNER  ) 


what'frec^eVrnVIa^^^  of 

a-  a serreT  O information  would  probably  court 

:^o:i;:::"u;ti::c::L:“e:x!r;^^^  ----- 

A:  So.  anyway  when  we  got  there  ,hc  funniest  thinp  (1)  happened. 

my  .sat  c.-twn  and  they  passed  out  these  little  booklets  (2). 
because  we  went  to  thoir  suite. 


B:  Uh-huh. 

A:  And  they  started  prenchinp  about  their  retipion  the  whole  three 
hours  (2)  and  we  were  just  crawling  the  walls  to  get  out. 


(REAL  - ORAL  - SOURCE:  SLIND-DATES.PROTOCOL) 


.Voil  ; n»J  “ '=®i"e  a PSEXPR,  not  bocau-o  ,t  i- 

doc  a CPR  ...S,  bol  nooaooc  ,t  io  II, c oub.cct  of  a verb  which  taboo  ,o  oubiccio  no  , 
which  -'Pfplo -onto,  the  vnH,  -happen-,  TI„o  loado  to  a now  principle,  ' 

If  an  expression  is  the  subject  of  an  event-verb 
then  it  is  a PREXPR. 

(PD4.1) 

The  set  of  event-verbs  is  <happen,  occur,  ...> 

e aso  know  that  candidaleo  for  involvernenf  in  PREF  with  ouont-nounr  mo'l  i„ 

ncnoVaMaw“"Tr'';;'''‘V""“’'  -nn'o,  c“  pr.  v. 

encr.ll  law..  I hcsilato  Io  make  a rule  of  candidate  oelection  oyl  of  thi'  intuition 

' cn.on  ' dt-cl:  :r  -y  »'  ‘la'actini  when  a docla::,,":' 

rfunA  hm,  ii  '»'''“»*=>■  Wa  Inlet  that  ,t  io  not  a otramc 

pr  funny  thing  that  the  speaker  and  the  group  of  people  the  speaker  was  with  "startld 
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circumstances.  It  is  this  complex  ur.de-rstandinr 
I ' • f ° decide  that  the  last  part  of  the  third  turn  is  not  part  of  the  PREF  We 

a^o  m,u,t  lhat  the  last  clause  of  the  f„st  turn  ,c  not  part  of  the  PREF,  and  we  do  co  no 

?h  no  ioTZ  circumstances  described,  it  was  not  a strange 

“ fhe^peaker  to  go  to  their  suite,  but  also  because  of  our  understanding,  of  the 
semantic  function  of  the  clause  “because  we  went  to  their  suite"  There  are  two  dif tint 

Itle^es  illustrated  by  the  two  following 

(a)  John  is  not  coming  to  the  meeting  tonight, 

because  he  is  sick. 

(b)  John  is  not  coming  to  the  meeting  tonight, 

„ because  he  just  phoned  me  from  Australia. 

introduce  considerations 

which  either  logically  or  plausibly  juslify  making  a certain  statement,  as  in  (b).  In  the 

dlsSn^^°'' H a certain 

description,  and  is  thus  not  part  of  that  description  itself. 

Let  us  now  turn  to  an  examination  of  some  cases  of  PREF  which  involve  pronouns. 

Some  general  heuristics  may  be  laid  down  at  the  outset.  The  first  concerns  the 

dIoIo'cTo  refer  to  actions  and  pronouns  which  refer  to 

propositions.  Consider  the  following  example  (7); 

A:  And,  for  your  information.  Jack.  Pm  just  going  to  tear  into  tome  hoof 
and  gravy  and  olhor  nsxortod  goodio.K,  (1) 

B:  I presume  that  you’re  doing  this  (2)  with  the  full  permission  and  - of 
the  commander. 


(REAL  - ORAL  - SOURCE:  APOLLO-13/PAGE379.PROTOCOL) 


ere  we  intuit  that  the  pronoun  (2)  is  involved  in  what  might  be  called  a repeated 
reference  to  an  action.  The  principle  clue  is  that  (2)  is  the  object  of  the  pro-verb  "do", 
clue,  in  so  far  as  I have  been  able  to  ascertain,  is  a frequent  and  reliable  one.  It  is 


This 


not  the  only  clue,  however,  as  the  example  (S)  show: 


A:  Wo  have  decided  to  use  a canister  and,  you  know  that  the 
liquid-cooled  garment  has  a bag  around  it  that  we  think  we  can 
use  too.  Or  that  we  know  we  can  use.  We’ve  tried  it  (1). 


(REAL  - ORAL  - SOURCE:  APOLLO-13/PAGE379.PROTOCOL) 


Hero  we  intuit  that  (1)  is  involved  in  a repeated  action  reference,  although  ust  what  action 
IS  involved  is  rather  unclear.  The  clue  here  is  that  the  pronoun  (1)  is  the  object  of  the 
verb  try  . So  v/e  can  formulate  the  following  rather  solid  heuristic- 
(PD5.1) 


If  a pronominal  expression  is  the  object  of  either  of 
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the  verbs  "do"  or  "try”,  then  it  is  not  a PREXPR. 

Secondly,  we  know  that  there  are  certain  predicates  which  mark  pronouns  as 
PREXPRs.  Some  of  these  can  be  grouped  Into  two  classes:  the  class  of  v/hat  can  bo 
loosely  called  "logical"  predicates,  such  as  "x  is  true/false"  , "x  is  possible/probable",  "x  is 
inconsistent"  or  "x  implies  y",  and  what  can  be  called  intentional  predicates,  such  as  "x 
knows/believes  y". 

(PD6.1) 

If  an  expression  is  such  that  some  logical  predicate  is 
attributed  to  its  referent,  then  it  is  a PREXPR. 

Logical  predicates  include  the  adjectives  "true",  "false", 

"probable",  "possible",  and  the  verbs  “imply",  "entail". 

(PD7.1) 

If  an  expression  is  the  object  of  an  intentional  verb, 
then  it  is  a PREXPR.  Intentional  verbs  are  a class  of 
verbs  which  includes  "know",  "believe",  "remember", 

"wants". 

As  soon  as  one  thinks  about  PD7.1,  hov/ever,  one  realizes  that  it  is  insufficient.  Most 
intentional  verbs  [17]  can  take  exp'essions  as  objects  which  denote  not  propositions,  but 
objects.  Thus  we  have  not  only  "John  remembers  that  Mary  is  sick  and  Paul  remembers  it 
too",  but  also  "John  remembers  Gill’s  boat  and  t5aul  remembers  it  too".  An  example  of 
such  a use  of  "know"  is  the  following: 

A:  ...  And  they  started  driving  and  I don’t  know  if  you  know  San 
Gabriel  Valley  whore  Crystal  Lake  is? 

B:  I don’t  know  the  area  too  well,  my  dear. 


(REAL  - ORAL  - SOURCE;  E3LIND-DATES.PR0T0C0L) 


Hov/ever,  if  one  restricts  PD7  to  pronominal  expressions,  then  it  can  be  defended  on  the 
basis  of  a frequency  argument.  If  one  examines  the  occurences  of  the  verbs  "know"  and 
"believe"  followed  by  "it"  or  "that"  in  the  dialogues  v/hich  are  presently  on  line,  one  find  , 
that  the  great  majority  are  Instances  of  PREF  phenomena.  A case  by  case  study  o: 
intentional  verbs  is  required  here.  But  for  the  moment,  lot  us  amend  and  restrict  PD7: 
(PD7.2) 

If  a pronominal  expression  is  the  object  of  one  of  the 
two  intentional  verbs  "know"  and  "believe",  then  it  is 
a PREXPR. 

One  last  clue  that  we  can  propose  for  the  detection  of  pronominal  PREXPRs  is  that 
the  pronoun  "so",  when  it  is  the  object  of  a verb,  is  alv/ays  a PREXPR  (as  far  as  I have 
been  able  to  ciotorminc).  Examples  arc:  "is  John  sick?  - I think  so."  end  "I  hope  I hat  Mary 
passed  her  exam.  - I hope  so  too." 

(PD3.1) 

All  occurences  of  the  pronoun  "so"  as  the  objects  of 


[17]  On  intentional  verbs  and  their  logical  peculiarities,  see  the  appendix  on  intentionality. 
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verbs  other  than  the  pro-veib  "did"  are  PREXPRs. 

This  clue  is  infrequent  but  very  reliable. 

Let  us  now  turn  to  the  consideration  of  some  examples  of  PREF  which  involve 
pronouns.  Example  (9): 

A:  Well,  you  miglu  have,  saved  your  life  (1),  my  dear  Caty. 

B:  I know  that  (2). 

A:  Yeh,  you  mi/rhi  all  have  been  wiped  oul  in  a drunken  car  aeeideiti  if 
you  hadn’t  done  that.  (3) 

B:  I know  it  (4). 


(REAL  - ORAL  - SOURCE:  BLIND-DATES.PROTOCOL) 


Both  (2)  and  (4)  are  marked  as  PREXPRs  by  rule  PD7.2.  The  problem  of  candidcitc- 
detection  and  selection  now  arises.  We  perceive  (1)  to  be  co-referential  with  (2)  and  (3) 
to  be  co-referential  with  (4).  This  suggests  the  follov/ing  pair  of  blatantly  rudimentary 
rules  of  candidate  detection  (PCD)  and  candidate  selection  (PCS): 

(PCD  1.1) 

The  candidates  for  co-referentiality  with  a pronominal 
PREXPR  Is  the  set  of  all  sentences  and  nominalised 
sentences  at  a distance  of  m sentences  from  the 
PREXPR  in  question  (before  or  aft^r),  where  rn  is 
some  search  parameter. 

(PCSl.l) 

Select  the  first  preceding  sentence  or  norninalized 
sentence  as  being  co-referential  with  a pronominal 
PREXPR. 

These  two  rules  seem  to  work  in  our  next  example  (10): 

A:  Very  briefly,  I’ve  had  a lot  of  pain  for  six  v/eeks  and  diagnosed  more 
or  less  as  a dislocated  disc.  Now,  what's  your  feelings?  In  the 
first  place,  they  said  cancer  - maybe.  So,  I had  a lot  of  tests 
done,  but  now,  since  I changed  doctors,  he  says  there  is,  well, 
there’s  this  hlood  test  ealled  C.l'.A,  uihic.h  will  tell  if  there  is 
eaneer  anywhere  in  your  system  (1).  And  I could  hardly  believe 
II  (2).  Now,  is  there  such  a test? 


(REAL  - ORAL  - SOURCE.  K/.EDICAL-CENTERED.PROTOCOL) 


And  again  in  the  following  examples,  drawn  from  the  same  source  and  v/hich  v/e  will  bring 
together  as  example  (11): 


1 
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A:  My  wife  knows  I wear  my  cap  She  way  I like  to.  And  I know  what’s 
Sjood  for  my  wife,  as  well  as  for  everybody  else.  / /.ee/)  my 
ItHsiiwss  In  mysolf,  wiilmil  nny  urod  nf  thosr  who  wear  foniUrtra 
ill  tlinir  caiis.  (1)  And  everybody  in  these  parts  knows  it  (2), 
thank  the  Lord! 


A:  Urr  lin.diniid  arrived  only  tliix  iiioniinir.  (1) 
B:  Oh,  oh,  you  know  that  (2)  too?  Bravo! 


A:  Your're  out  of  your  mind!  (1) 

B;  Yes,  it's  (2)  true!  I'm  out  of  my  mind! 


A:  I’m  soing  home  because  my  husband’s  on  my  mind.  / didn’t  see  him 
in  r/iurr/i.  (1) 

B:  Don’t  think  of  that  (2).  He’ll  be  along  to  the  square. 


(LITERARY  - WRITTEN  - SOURCE:  CAVALLERIA  RUSTICANA) 


These  rules  v/ill  also  operate  satisfactorily  on  the  follo'wing  exarnpl3  (11)  if  supplemented 
by  the  general  principles  for  pronoun  resolution  GRl  and  GR2  set  forth  above  in  section 
II.  1: 

A:  I heard  them  say  that  she  had  been  a loose  u.oinnn  (1)!  I don’t  want 
to  believe  it  (2)  - I still  don't  believe  it  (3)  - but  I can’t  help 
feeling  that  it  (4)  is  true.  Everything  points  to  it  (5)  - and  I feel 
ashamed,  mortified!  Ashamed  to  shov/  myself  in  her  company. 

Everybody  seems  to  be  staring  at  us  - I seem  to  feel  the  men 
ogling  us!  It’s  (6)  frightful!  But  can  it  (7)  really  be  true?  Do  you 
think  it  (8)  can  be  true?  Tell  me! 


(LITERARY  - WRITTEN  - SOURCE:  MOTHERLOVE) 


If  the  search  parameter  m of  the  general  rules  is  sufficiently  large,  the  expressions  (2}-^£; 
v/ould  be  determined  to  be  co-referentia!  with  (1).  Some  disatir.faction  might  be  felt  v/iin 
this  result  in  regard  to  (6),  however,  since  what  is  said  to  be  frightful  is  perhaps  the 
speaker’s  shame  and  embarassrriert  rather  than  (or  perhaps  as  v/ell  as)  the  purported  fact 
that  she  (th.e  mother)  had  been  a loose  woman.  Intuitions  are  not  very  clear  on  this  point, 
and  the  question  can  be  answered  either  way  v/ith  little  impact  on  the  dialogue  analysis  in 
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this  particular  case.  The  problem,  though  not  very  serious,  signals  a fact  that  we  v/ill 
return  to  below,  viz.  the  fact  that  pronominal  PREXPRs  which  have  altitudinal  adjectives 
predicated  of  them  are  more  difficult  to  select  candidates  for  than  pronouns  which  have 
say,  logical  predicates. 

For  the  moment,  let  us  comment  furthur  on  PCS  1.1  It  is  unciear  just  what  we  mean 
by  sentence  in  that  rule:  do  we  moan  literally  a string  of  words  ending  with  a period,  or 
a sentential  constituent,  of  which  there  may  be  several  in  a sentence  in  the  literal  sense? 
The  latter  interpretation  seems  required  by  examples  such  as  the  following  (13): 


A.  Sorry  to  bother  you,  but  xonironn  snnms  to  have  chniiftcd  ono  of  our 
/}nsnu;ord.<i  (1)  and  no  one  here  knows  anything  about  it  (2). 


(REAL  - WRITTEN  - SOURCE:  OC133.PROTOCCL) 


and  the  following  example  (14); 

A:  Fred,  in  nhout  ■/•  luinnto.^,  trr’ro  poin/r  to  hand  yon  over  to  u different 
communii'otions  site,  and  iV.<  poinp  to  ttiho  us  nhont  n luiimto  or 
so  to  rr-ostnblish  uidink  (1),  so  you  can  be  prepared  for  that  (2), 


(REAL  - ORAL  - SOURCE:  APOLLO- 13/PAGE  379.PROTOCOL) 

Secondly,  we  must  ask  how  PCSl.l  fares  when  the  preceding  sentence  is  in  ♦hio 
interrogative  mood.  There  are  some  cases  involving  the  PREXPR  "so"  in  which  that 
pronoun  is  involved  in  a PREF  wilh  the  declarative  transform  of  the  preceding 
interrogative  sentence:  "Is  Mary  sick?  - I believe/think  so."  Such  cases  only  occur  when 
the  preceding  question  is  not  of  the  WH-type:  consider  the  absurdity  of  "Who  is  the 
President?  - I believe  so."  Apart  from  the  special  "so"  cases,  there  seem  to  be  two  olhnr 
kinds  of  cases,  illustrated  by  (a)  and  (b): 

(a)  Is  Mary  sick? 

I don’t  know  that. 

How  many  feet  are  there  in  a meter? 

Oh,  I learned  that  in  school. 

(b)  Is  Mary  sick? 

What  makes  you  ask  that? 

How  many  feet  are  there  in  a meter? 

I often  wonder  about  that  myself. 

In  cases  of  type  (a),  the  pronoun  •■'nds  in  for  an  answer  to  the  preceding  question, 
whereas  in  cases  of  type  (b),  the  pronoun  stands  in  for  the  preceding  question  itself.  If  is 
difficult  to  find  a principle  which  would  distinguish  between  the  fv/o  cases.  One  possible 
solution  would  focus  upon  the  role  which  the  verbs  in  the  verb-phrases  containing  the 
pronouns  usually  play  with  regard  to  presuppositions.  This  is  a problem  I hope  to  do 
more  work  onj  at  present  I can  only  pose  it. 


A:  Just  what  are  you  trying  to  tell  me,  young  lady? 

IIZ  7 ■ "”'•  “> »''»«“  1‘Z 

^ r^r  «W  gain  ,.  co^mnon,  or  to  l.rromo  onboard  to  and  marry 
y iiff  man  if  a ffood^  fps/wetahln  family,.,  (2) 


u will  give 


(LITERARY  - WRITTEN  - SOURCE:  MOTHERLOVE) 


PCSl  should  therefore  be  modified  so 
pronorn:nalization,  at  least  by  treating  “this" 
a forward  direction.  ’ ’ 


as  to  allow  for  forwaro  propositional 
as  opposed  to  "it"  or  "that",  as  a cue  for  such 


orereH'*''^'^  V according  to  whic 

(17)  below:'''  clause  woul 


one  should  select  the 
fail,  such  a-,  example 


A:  n.  „„„  i.  „„ 

B;  L.V.N,  ...  That’s  lanky  vertiginous  nurse? 


A:  Right. 


B;  I think  that  (2)  ’s  terrific. 
29th  anniversary?  • 


Liston,  what  are  you  getting  Sam  for  your 


(REAL 


ORAL  - SOURCE:  BLIND-DATES.PROTOCOL) 


Such  a case  might  be  handled  by  modifyino  PCSl  -o  that  thnro  r j . 

involved  in  "correction-ac  en^"  fir  i " candidates  which  were 

would  be  difficult  rope  ti^aiiz  bi;  owe 

pre«„t  locu:  on  L p'loern  j ’ T ““  '-'"Ah  tho 

rolol,vo;riLu!rct.«,°7.oul/;^^^^ 

unmanageable  cases.  ^ uii»-cultie.i  founcJ  in  more 

oboo,  lv:n"rL:7r„ra«  :“,'r 

ouch  00  CO, no, on,;  oho^'l^.'h/^rn  "o  To  e';’  ".t,"":':'!';'  ■ T7""- 

aboo,  Ph,cica,  objocs.  An^  who^n  cuc^  :o7J„,roc7l:r,h7;'l°:7'toX7c:7; 


any  Clear  way  of  determinins  that  wc  arc  faced  with 
-ttitudinal  comments.  Consider  example  (18): 


a rKtXPK. 


inis  IS  especially  true 


A:  One  is  out  in  the 
lltinffs  out  thcr<f 

B:  Hey,  that  (2)  ’s  neat. 


Sarage.  Sho’s  n irnchnr,  and  xhe’s  got  all  hr.r 
am  sltr'x  got  hor  littln  radio  tlwro.  (1) 

What’s  her  first  name? 


(REAL  - ORAL  - SOURCE:  BLIND-DATES.PROTOCOL) 


"t " muchiori'. 

co-reforential  with  all  of  (1),  rather  than^wm?"""^"  "i°  probably 

relies  on  one’s  knowledse  of  the  “ifuation  in  P‘'"'ticular,  on 

situation  in  which  the  participants  arrlot  i^tre  f ! ^O'^'^^^ication  is  taking  place,  n 
example  (19);  location.  Let  us  consider  another 


B:  Okay,  IM-milo  porigro  mm.  (2)  Tim  (3)  ’s  very  nic 


(REAL 


ORAL 


CORRECTION-ACTIONS-CORPUS.PROTOCOL) 


SOURCE: 


H^wovoi',  'n!  Sh^‘‘diS'’to’f  'V?  <21  »"«  therofore  with  (1) 

tu.  Which  ^drxttdti  •“  ^ 

probably  rnorr  '-equently  attributerl  fn  nhi  ♦ "n,  1 predicates  are 

one  is  left  with  routines  w^h  ^ And  so 

the  noun-phrase  "136-mile  perigee".  reference,  co-referential  with 

have  what  o‘ne°'l^iohrclinn^^^  P''Onommal  PREXPRs  often 

but  indeterminate  number  are  co-referential  with  a largo 

consider  the  expressions  (l)-(6)  in  the  Inn  ti"  ^ P''°POsitions.  As  an  example  of  this 
scope  in  differing  degree  and  - 

(e.g.  (D):  ’ propositional  pronorninalization 


C 

...your  worst  experience  on  a blind  date 
name  is  ... 


- especially  if  your  first 


T 

Sharon 


propositional  REFERENCE:  CONCEPTS  AND  DETECTION 
DE  ECTION  OF  TEXT  AND  REPREATED  PROPOSITIONAL  REFERENCE 


C 

Sharon,  how  old  are  you? 
T 

I’m  21. 


C 

O.K.,  are  you  blond  or  brunette? 

T 

I’m  a brunette. 

C 

O.K.  Tell  me  about  your  worst  experience  on  a blind  date  mv 


T 

Well,  my  v/orst  experience  happened  when  I was  uh,  well  I was  in 
college,  I was  going  to  a girl’s  school. 

C 

uh,  huh  ... 


T 


...and  they  used  to  have  all  these 
in  ind  entertain  us. 


singing  groups,  you  know,  come 


C 

Do  you  mean  singing  groups  - professionals,  or  do  you  mean  from 
fraternities? 


T 

No,  these  were  professionals  singing. 

C 

Oh,  yeh.  Like  the  Four  Freshmen  and  people  like  that? 
T 

Veh,  well,  I ... 

C 

Like  the  Four  Preps. 

T 

I m not  going  to  name  the  group. 

C 

Yeh. 


T 

Because  /'/  (I)  was  really  strange,  because  there  were 


PROPOSITIONAL  REFERENCE:  CONCEPTS  AND  DETECTION 
DETECTION  OF  TEXT  AND  REPREATED  PROPOSITIONAL  REFERENCE 


about  3 of  U5  and  one  girl  know  the  group.  So,  you  know  she  asked 
would  we  like  to  go  on  this  date  with  this  singing  group?  We 
thought,  Oh  Wow,  Yes! 

C 

You  mean,  two  girls  go  out  with  four  guys? 

T 

No,  there  were  four  guys  but  there  would  be  A of  us. 

C 

Oh,  I see. 


T 

So,  I got  2 other  friends  and  you  know,  we  thought  77/ /S  (2) 
was  really  going  to  be  fantastic.  We  were  going  "to  get  drunk 
and  have  a great  big  dinner  and  really  have  a ball. 

C 

Uh! 


And  so,  uh,  they  had  a limousine  to  come  pick  us  up  and  it  took 
US  to  the  motel  and  we  started,  you  know,  giving  each  other  the 
eye  and  getting  kind  of  nervous. 


C 

They  brought  you  right  over  to  the  motel.  They  figured  you  were 
going  to  sign  up  as  "groupies". 


T 

Exactly,  I think.  That’s  what  we  thought,  at  least. 

C 

And,  no  dinner? 

T 

And  no  dinner,  yes! 

C 

Wow! 

T 

So,  anyway  when  we  got  there  the  funniest  thing  happened.  They 
sat  down  and  they  passed  out  these  little  booklets,  because  v/e 
wont  to  their  su'te. 

C 

Uh  huh. 


T 
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And,  they  started  preaching  about  their  religion  the  whole  three 
hours  and  we  were  just  crawling  the  walls  to  get  out. 

C 

If  was  a religious  frenzy? 

T 

Yeh! 

C 

Were  they  sifting  around  in  their  Saffron  robes? 

T 

Exactly! 

C 

And  their  little  Chinese  Terriple  gongs? 

T 

Right! 

C 

What  a weird,  what  a weird,  wow! 

T 

And,  we  said,  well  can  we  order  a drink?  And,  they  said.  Oh, 

'I  H/]Vs  (2)  not  the  way  of  our  religion  - we  can’t  drink. 


C 

Oh.... 

T 

/'/  (3)  v/as  really  the  most  boring  time  as  compared  to 
v/haf  we  were  expecting.  I think  it  would  have  been  more  fun  to  oq 
thrashing  the  hotel  room.  “ 

C 

Why  of  course!  How  long  did  you  girls  hand  around  with  these 
religious  freaks? 

T 

Well,  we  hung  around  ...  I guess  we  left  around  11  and  I guess 
we  got  back  around  3 in  the  morning. 

C 

Oh  wow,  you  stayed  too  long,  honey. 

T 

Yeh,  but  we  were  trapped  actually. 
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C 

Yeh,  well  did  THAT  (i)  turn  you  against  Saffron  robes 
permanently? 

T 

Oh  well,  no,  not  really  but  I just  wasn’t  expecting  ii 
(5)  at  that  time. 

C 

Yeh.  THAT  (6)  a dandy.  I’m  delighted  you  called, 

Sharon.  You  really  surprised  me.  I thought  I was  going  to  have  to 
bicep  you  out. 

T 

No.  I called  you  twice  before.  Bill. 

In  conclusion,  we  may  say  that  PREF  detection  is  considerably  more  difficult  than  is  TREF 
detection.  It  is  hoped  that  the  few  preliminary  approachs  presented  above  are  useful  if 
only  bocauso  ousgcstivc. 
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